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NXT Heparin Affinity Glyc.grad. 
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GJyc. grod. 
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FIGURE 9 

1 AAAACCCCAA AACCCCAAAA CCCCTTTTAG AGCCCTGCAG TTGGAAATAT 

5 1 AACCTCAGTA TTAATAAGCT CAGATTTTAA ATATTAATTA CAAAACCTAA 

101 ATGGAGGTTG ATGTTGATAA TCAAGCTGAT AATCATGGCA TTCACTCAGC 

1 5 1 TCTTAAGACT TGTGAAGAAA TTAAAGAAGC TAAAACGTTG TACTCTTGGA 

201 TCCAGAAAGT TATTAGATGA AGAAATCAAT CTCAAAGTCA TTATAAAGAT 

251 TTAGAAGATA TTAAAATATT TGCGCAGACA AATATTGTTG CTACTCCACG 

301 AGACTATAAT GAAGAAGATT TTAAAGTTAT TGCAAGAAAA GAAGTATTTT 

351 CAACTGGACT AATGATCGAA CTTATTGACA AATGCTTAGT TGAACTTCTT 

401 TCATCAAGCG ATGTTTCAGA TAGACAAAAA CTTCAATGAT TTGGATTTCA 

45 1 ACTTAAGGGA AATCAATTAG CAAAGACCCA TTTATTAACA GCTCTTTCAA 

501 CTCAAAAGCA GTATTTCTTT CAAGACGAAT GGAACCAAGT TAGAGCAATG 

551 ATTGGAAATG AGCTCTTCCG ACATCTCTAC ACTAAATATT TAATATTCCA 

601 GCGAACTTCT GAAGGAACTC TTGTTCAATT TTGCGGGAAT AACGTTTTTG 

65 1 ATCATTTGAA AGTCAACGAT AAGTTTGACA AAAAGCAAAA AGGTGGAGCA 

701 GCAGACATGA ATGAACCTCG ATGTTGATCA ACCTGCAAAT ACAATGTCAA 

75 1 GAATGAGAAA GATCACTTTC TCAACAACAT CAACGTGCCG AATTGGAATA 

801 ATATGAAATC AAGAACCAGA ATATTTTATT GCACTCATTT TAATAGAAAT 

85 1 AACCAATTCT TCAAAAAGCA TGAGTTTGTG AGTAACAAAA ACAATATTTC 

901 AGCGATGGAC AGAGCTCAGA CGATATTCAC GAATATATTC AGATTTAATA 

95 1 GAATTAGAAA GAAGCTAAAA GATAAGGTTA TCGAAAAAAT TGCCTACATG 

1 00 1 CTTGAGAAAG TC AAAGATTT TAACTTCAAC TACTATTTAA C AAAATCTTG 

1 05 1 TCCTCTTCC A GAAAATTGGC GGGAACGGAA ACAAAAAATC GAAAACTTGA 

1101 TAAATAAAAC TAGAGAAGAA AAGTCGAAGT ACTATGAAGA GCTGTTTAGC 

1151 TACACAACTG ATAATAAATG CGTCACACAA TTTATTAATG AATTTTTCTA 

1 20 1 CAATATACTC CCCAAAGACT TTTTGACTGG AAGAAACCGT AAGAATTTTC 

1251 AAAAGAAAGT TAAGAAATAT GTGGAACTAA ACAAGCATGA ACTCATTCAC 

1 30 1 AAAAACTTAT TGCTTGAGAA GATCAATACA AGAGAAATAT CATGGATGCA 

1351 GGTTGAGACC TCTGCAAAGC ATTTTTATTA TTTTGATCAC GAAAACATCT 

1401 ACGTCTTATG G AAATTGCTC CGATGG ATAT TCGAGGATCT CGTCGTCTCG 

1 45 1 CTGATTAGAT GATTTTTCTA TGTCACCGAG CAACAGAAAA GTTACTCCAA 

1 50 1 AACCTATTAC TAC AGAAAGA ATATTTGGGA CGTCATTATG AAAATGTCAA 

1551 TCGCAGACTT AAAGAAGGAA ACGCTTGCTG AGGTCCAAGA AAAAGAGGTT 

1 601 GAAGAATGGA AAAAGTCGCT TGGATTTGCA CCTGGAAAAC TCAGACTAAT 

1 65 1 ACCGAAGAAA ACTACTTTCC GTCCAATTAT GACTTTCAAT AAGAAGATTG 

1 70 1 TAAATTCAGA CCGGAAGACT ACAAAATTAA CTACAAATAC GAAGTTATTG 

1 75 1 AACTCTCACT TAATGCTTAA GACATTGAAG AATAGAATGT TTAAAGATCC 

1 80 1 TTTTGGATTC GCTGTTTTTA ACTATGATGA TGTAATGAAA AAGTATGAGG 

1 85 1 AGTTTGTTTG CAAATGGAAG CAAGTTGGAC AACCAAAACT CTTCTTTGCA 

1901 ACTATGGATA TCGAAAAGTG ATATGATAGT GTAAACAGAG AAAAACTATC 

1 95 1 AACATTCCTA AAAACTACTA AATTACTTTC TTCAGATTTC TGGATTATGA 

2001 CTGCACAAAT TCTAAAGAGA AAGAATAACA TAGTTATCGA TTCGAAAAAC 

205 1 TTTAGAAAGA AAGAAATGAA AGATTATTTT AGACAGAAAT TCCAGAAGAT 

2101 TGCACTTG AA GG AGGACAAT ATCC AACCTT ATTCAGTGTT CTTGAAAATG 

2151 A AC A A A ATG A CTTAAATGCA AAG AA AACAT TAATTGTTGA AGCAAAGCAA 

220 1 AGAAATTATT TTAAGAAAGA TAACTTACTT CAACCAGTCA TTAATATTTG 

225 1 CCAATATAAT TACATTAACT TTAATGGGAA GTTTTATAAA CAAACAAAAG 

2301 GAATTCCTCA AGGTCTTTGA GTTTCATCAA TTTTGTCATC ATTTTATTAT 
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FIGURE 9 (cont.) 



2351 GCAACATTAG AGGAAAGCTC CTTAGGATTC CTTAGAGATG AATCAATGAA 
2401 CCCTGAAAAT CCAAATGTTA ATCTTCTAAT GAGACTTACA GATGACTATC 
245 1 TTTTGATTAC AACTCAAGAG AATAATGCAG TATTGTTTAT TGAGAAACTT 
2501 ATAAACGTAA GTCGTGAAAA TGGATTTAAA TTCAATATGA AGAAACTACA 
2551 GACTAGTTTT CCATTAAGTC C AAGCAAATT TGCAAAATAC GGAATGGATA 
2601 GTGTTGAGGA GCAAAATATT GTTCAAGATT ACTGCGATTG GATTGGCATC 
265 1 TCAATTGATA TGAAAACTCT TGCTTTAATG CCAAATATTA ACTTGAGAAT 
2701 AGAAGGAATT CTGTGTACAC TCAATCTAAA CATGCAAACA AAGAAAGCAT 
275 1 CAATGTGGCT CAAGAAGAAA CTAAAGTCGT TTTTAATGAA TAACATTACC 
2801 CATTATTTTA GAAAGACGAT TACAACCGAA GACTTTGCGA ATAAAACTCT 
285 1 CAACAAGTTA TTTATATCAG GCGGTTACAA ATACATGCAA TGAGCCAAAG 
2901 AATACAAGGA CCACTTTAAG AAGAACTTAG CTATGAGCAG TATGATCGAC 
295 1 TTAGAGGTAT CTAAAATTAT ATACTCTGTA ACC AGAGCAT TCTTTAAATA 
3001 CCTTGTGTGC AATATTAAGG ATACAATTTT TGGAGAGGAG CATTATCCAG 
305 1 ACTTTTTCCT TAGCACACTG AAGCACTTTA TTGAAATATT CAGCACAAAA 
3101 AAGTAC ATTT TCAAC AGAGT TTGC ATGATC CTCAAGGCAA AAGAAGCAAA 
3151 GCTAAAAAGT GACC AATGTC AATCTCTAAT TC AATATG AT GC ATAGTCGA 
3201 CTATTCTAAC TTATTTTGGA AAGTTAATTT TCAATTTTTG TCTTATATAC 
3251 TGGGGTTTTG GGGTTTTGGG GTTTTGGGG 
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1 MEVDVDNQAD NHGIHSALKT CEEIKEAKTL YSWIQKVIRC RNQSQSHYKD 
51 LEDIKIFAQT NIVATPRDYN EEDFKVIARK EVFSTGLMIE LIDKCLVELL 
101 SSSDVSDRQK LQCFGFQLKG NQLAKTHLLT ALSTQKQYFF QDEWNQVRAM 
151 IGNELFRHLY TKYLIFQRTS EGTLVQFCGN NVFDHLKVND KFDKJCQKGGA 
201 ADMNEPRCCS TCKYNVKNEK DHFLNNINVP NWNNMKSRTR IFYCTHFNRN 
251 NQFFKKHEFV SNKNN1SAMD RAQTIFTN1F RFNRIRKKLK DKVIEKIAYM 
301 LEKVKDFNFN YYLTKSCPLP ENWRERKQKJ ENLINKTREE KSKYYEELFS 
351 YTTDNKCVTQ FINEFFYNIL PKDFLTGRNR KNFQKKVKKY VELNKHELIH 
401 KNLLLEKJNT REISWMQVET SAKHFYYFDH ENIYVLWKLL RWIFEDLWS 
451 LIRCFFYVTE QQKSYSKTYY YRKN1WDVIM KMSIADLKKE TLAEVQEKEV 
501 EEWKKSLGFA PGKLRLIPKK TTFRPIMTFN KKIVNSDRKT TKLTTNTKLL 
551 NSHLMLKTLK NRMFKDPFGF AVFNYDDVMK KYEEFVCKWK QVGQPKLFFA 
601 TMDDBKCYDS VNREKLSTFL KTTKLLSSDF WIMTAQILKR KNNIVIDSKN 
651 FRKKEMKDYF RQKFQKIALE GGQYPTLFSV LENEQNDLNA KXTLIVEAKQ 
701 RNYFKKDNLL QPVINICQYN YINFNGKFYK QTKGIPQGLC VSSILSSFYY 
75 1 ATLEESSLGF LRDESMNPEN PNVNLLMRLT DDYLLITTQE NNAVLFIEKL 
801 INVSRENGFK FNMKKLQTSF PLSPSKFAKY GMDSVEEQNI VQDYCDWIGI 
851 SIDMKTLALM PNINLRIEGI LCTLNLNMQT KKASMWLKKK LKSFLMNNIT 
901 HYFRKTITTE DFANKTLNKL FISGGYKYMQ CAKEYKDHFK KNLAMSSMID 
951 LEVSKIIYSV TRAFFKYLVC NIKDTIFGEE HYPDFFLSTL KHFIEIFSTK 
1001 KYTFNRVCMI LKAKEAKLKS DQCQSLIQYD A 




FIGURE 11 

1 CCCCAAAACC CCAAAACCCC AAAACCCCTA TAAAAAAAGA AAAAATTGAG 
51 GTAGTTTAGA AATAAAATAT TATTCCCGCA CAAATGGAGA TGGATATTGA 
101 TTTGGATGAT ATAGAAAATT TACTTCCTAA TACATTCAAC AAGTATAGCA 
151 GCTCTTGTAG TGACAAGAAA GGATGCAAAA CATTGAAATC TGGCTCGAAA 
201 TCGCCTTCAT TGACTATTCC AAAGTTGCAA AAACAATTAG AGTTCTACTT 
251 CTCGGATGCA AATCTTTATA ACGATTCTTT CTTGAGAAAA TTAGTTTTAA 
301 AAAGCGGAGA GCAAAGAGTA GAAATTGAAA CATTACTAAT GTTTAAATAA 
35 1 AATCAGGTAA TGAGGATTAT TCTATTTTTT AGATCACTTC TTAAGGAGCA 
401 TTATGGAGAA AATTACTTAA TACTAAAAGG TAAACAGTTT GGATTATTTC 
451 CCTAGCCAAC AATGATGAGT ATATTAAATT CATATGAGAA TGAGTCAAAG 
50 1 GATCTCGATA CATC AGACTT ACCAAAGACA AACTCGCTAT AAAACGCAAG 
551 AAAAAGTTTG ATAATCGAAC AGCAGAAGAA CTTATTGCAT TTACTATTCG 
601 TATGGGTTTT ATTACAATTG TTTTAGGTAT CGACGGTGAA CTCCCGAGTC 
651 TTGAGACAAT TGAAAAAGCT GTTTACAACT GAAGGAATCG CAGTTCTGAA 
701 AGTTCTGATG TGTATGCCAT TATTTTGTGA ATTAATCTCA AATATCTTAT 
751 CTCAATTTAA TGGATAGCTA TAGAAACAAA CCAAATAAAC CATGCAAGTT 
801 TAATGGAATA TACGTTAAAT CCTTTGGGAC AAATGCACAC TGAATTTATA 
851 TTGGATTCTT AAAGCATAGA TACACAGAAT GCTTTAGAGA CTGATTTAGC 
901 TTACAACAGA TTACCTGTTT TGATTACTCT TGCTCATCTC TTATATCTTT 
951 A AAAG AAGC A GGCGAAATGA AAAGAAGACT AAAGAAAGAG ATTTCAAAAT 
1001 TTGTTGATTC TTCTGTAACC GGAATTAACA ACAAGAATAT TAGCAACGAA 
1051 AAAGAAGAAG AGCTATCACA ATCCTGATTC TTAAAGATTT CAAAAATTCC 
1 101 AGGTAAGAGA GATACATTCA TTAAAATTCA TATATTATAG TTTTTCATTT 
1151 CACAGCTGTT ATTTTCTTTT ATCTTAACAA TATTTTTTGA TTAGCTGGAA 
1201 GTAAAAAGTA TCAAATAAGA GAAGCGCTAG ACTGAGGTAA CTTAGCTTAT 
1 25 1 TC AC ATTCAT AG ATCGACCT TC ATATATCC AATACGATGA TAAGGAAACA 
1301 GCAGTCATCC GTTTTAAAAA TAGTGCTATG AGGACTAAAT TTTTAGAGTC 
1351 AAGAAATGG A GCCGAAATCT TAATCAAAAA GAATTGCGTC GATATTGCAA 
1401 AAGAATCGAA CTCTAAATCT TTCGTTAATA AGTATTACCA ATCTTGATTG 
1451 ATTGAAGAGA TTGACGAGGC AACTGCACAG AAGATCATTA AAGAAATAAA 
1501 GTAACTTTTA TTAATTAGAG AATAAACTAA ATTACTAATA TAGAGATCAG 
1551 CGATCTTCAA TTGACGAAAT AAAAGCTGAA CTAAAGTTAG ACAATAAAAA 
1601 ATACAAACCT TGGTCAAAAT ATTGAGGAAG GAAAAGAAGA CCAGTTAGCA 
1651 AAAGAAAAAA TAAGGCAATA AATAAAATGA GTACAGAAGT GAAGAAATAA 
1701 AAGATTTATT TTTTTCAATA ATTTATTGAA AAGAGGGGTT TTGGGGTTTT 
1751 GGGGTTTTGG GG 




FIGURE 12 



CCCCAAAACCCCAAAACCCCAAAACCCCTATAAAAAAAGAAAAAATTGAGGTAGTTTAGA 



GGGGTTTTGGGGTTTTGGGGTTTTGGGGATATTTTTTTCTTTTTT 

a PQNPKTPKPL * K K K K L R - * F R 

b PKTPKPQNPYKKRKNCGSLE- 

c PKPQNPKTPI KKEKI E V V " K - 

AAT AAAAT ATT ATTC C CGC AC AAATGG AG ATGG AT ATTG ATTTGG ATG AT AT AG AAAATT 

61 «. + * + + + 120 

TT ATTTT AT AAT AAGGGCGTGTTT ACCTCT AC CT AT AAC T AAACCT ACT AT ATCTTTT AA 

a nkilfphkwrwiliw"'mi.*ki 

b IKYYSRTNGDGYCFGC 'V R K F - 

C * N I I PAQMEMDIDLDD I ENL- 

TACI^CCTAATACATTCAACAAGTATAGCAGCTCTTGTAGTGACAAGAAAGGATGCAAAA 

121 + + * ♦ * 180 

ATGAAGGATTATGTAAGTTGTTCATATCGTCGAGAACATCACTGTTCTTTCCTACGTTTT 

a YFLIHSTS IAALVVTRKDAK 

b TS*YIQQV*QLL* 'QE'RMQN- 

C LPNTFNKYSS SCS DKKOCKT- 

CATTGAAATCTGGCTCGAAATCGCCTTCATTGACTATTCCAAAGTTGCAAAAACAATTAG 
181 * + ^ + ^ 240 

GTAACTTTAGACCGAGCTTTAGCGGAAGTAACTGATAAGGTTTCAACGTTTTTGTTAATC 

a hcnlarnrlhcl'fqscknn* 

b I EIWL.E I AF I DYS K VAKT IR - . 

C LKSGSKS PSLT I PKLQKQLK- 

AGTTCTACTTCTCGGATGCAAATCTTT AT AACGATTCTTTCTTG AG AAAATT AGTTTTAA 
241 . + + . , 300 

TCAAGATCAAGAGCCTACGTTT AG AAATATTGCTAAGAAAGAACTCTTTTAATC AAAATT 

a SST3RMQI F ITILSCEN* F " 

b VLLLGCKSL* RFFLEK ISFK 

c fyfsdanlyndsflrklvlk;- 

AAACCGGAGAGCAAAGAGTAGAAATTGAAACATTACTAATGTTTAAATAAAATCAGGTAA 

301 ♦ * * 360 

TTTCGCCTCTCGTTTCTCATCTTTAACTTTGTAATGATTACAAATTTATTTTAGTCCATT 

a kaeske-klkhy-clnkir- 

b krraksrncnitnv-iksgn- 

C SGEQRVK I ETLLMFK * N Q V M - 

TGAGGATTATTCTATTTTTTAGATCACTTCTTAAGGAGCATTATGGAGAAAATTACTTAA 

361 * * . A2 0 

ACTCCTAATAAGATAAAAAATCTAGTGAAGAATTCCTCGTAATACCTCTTTTAATGAATT 

a CGLFYFLDHFLRS I MEKIT* 

t> EDYSIF'ITS'GALWRKLLN- 

c RIILFFRSLLKEHYCENYLI- 

TACTAAAACGTAAACAGTTTGGATTATTTCCCTAGCCAACAATGATCACTATATTAAATT 

421 * * * ♦ 480 

ATG ATTTTCCATTTGTC AAACCT AAT AAAGGGATCGGTTGTTACTACTCATATAATTTAA 

a Y-KVNSLDYFPSOQCCVY'I 

b TKR'TVWI ISLANNDEYIKF- 

c L.KGXQFGLFP * PTMMS I LNS- 
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FIGURE 12 (cont.) 

CATATGAGAATGAGTCAAAGGATCTCGATACATCAGACTTACCAAAGACAAACTCGCTAT 

481 ♦ ♦ 540 

GTATACTCTTACTCAGTTTCCTAGAGCTATGTAGTCTGAATGGTTTCTGTTTGAGCGATA 

a HMRMS.QRISIHQTYQRQTRY 
b ICECVKGSRY I RLTKDKLAI 

c YENESKDLDTSDLPKTNSL'- 

AAAACGCAAGAAAAAGTTTGATAATCGAACAGCAGAAGAACTTATTGCATTTACTATTCG 

5 41 * * - ♦ * 600 

TTTTGCGTTCTTTTTCAAACTATTAGCTTGTCGTCTTCTTGAATAACGTAAATGATAAGC 



a 



KTQEKVC " SNSRRTYC I YYS 
b KRKKKFDNRTAEELIAFTIR- 
c NARKSLIIEQQKNLLHLLFV- 

TATGGGTTTTATTACAATTGTTTTAGGTATCGAGGGTGAACTCCCGAGTCTTGAGACAAT 

601 ♦ ♦ ♦ 660 

ATACCCAAAATAATGTTAACAAAATCCATAGCTGCCACTTGAGGGCTCAGAACTCTGTTA 

a YGFYYNCFRYRRCTPESCDN 

b MGFITIVLG IDGELPSLETI- 

C WVLLQLF-VSTVNSRVLRQL- 

TGAAAAAGCTGTTTACAACTGAAGGAATCGCAGTTCTGAAAGTTCTGATGTGTATGCCAT 

661 * ♦ 720 

ACTTTTTCGACAAATGTTGACTTCCTTAGCGTCAAGACTTTCAAGACTACACATACGGTA 

a CKSCLQLKESQFCKF * C V C H 

b EKAVYNCRNRS SESSDVYAI 

c KKLFTTEG IAVLKVLMCMPL- 

TATTTTGTGAATTAATCTCAAATATCTTATCTCAATTTAATGGATAGCTATAGAAACAAA 

721 + ♦ ♦ + 780 

ATAAAACACTTAATTAGAGTTTATAGAATAGAGTTAAATTACCTATCGATATCTTTGTTT 

a YFVN*SQISYLNLMDSYRNK 

b ILCINLKYLISI'WIAIETN- 

C FCELI SN I LSQFNG * L *KQT- 

CCAAATAAACCATGCAAGTTTAATGGAATATACGTTAAATCCTTTGGGACAAATGCACAC 

781 * * 840 

GGTTTATTTGGTACGTTCAAATTACCTTATATGCAATTTAGGAAACCCTGTTTACGTGTG 

a PNKPCKFNG I YVKSFGTNAH 

b QINHASLM EYT LN PLG QMHT 

c K - T y. 0 V ' W N I R * I L W D K C T L - 

TGAATTTATATTGGATTCTTAAAGC ATAGATACACAGAATGCTTTAGAGACTGATTTAGC 

841 900 

ACTXMATATAACCTAAG.>ATTTCGTATCTATGTGTCTTACGAAATCTCTGACTAAATCG 

a CIY IGFLKHRYTECFRDCFS 

b £ r I LOS * S IDTQNALETDLA 

C N L Y W : L X A * r H R M L " R L I " L - 

TTACAACAGATTACCTGTTTTGATTACTCTTGCTCATCTCTTATATCTTTAAAAGAAGCA 

901 - 960 

AATGTTGTCTA^TCGACAAAACTAATGAGAACGAGTAGAGAATATAGAAATTTTCTTCGT 

a LQQITCFDYSCSSLISLKEA 

b YNRLPVLITLAHLLYL'KKQ- 

c TTDYLFCLLLLISY I FKRSR- 

GGCGAAATGAAAAC AAGACTAAAGAAAGAGATTTCAAAATTTGTTGATTCTTCTGTAACC 

961 . <► * 1020 

CCGCTTTACTTTTCTTCTGATTTCTTTCTCTAAAGTTTTAAACAACTAAGAAGACATTGG 

a CEMKRRLKKEISKFVDSSVT 

b AKCKED'RKRFQNLLILL'P- 

c RN EKKTK ERDFK I CC FFCNR- 

GGAATTAACAACAAGAATATTAGCAACGAAAAAGAAGAAGAGCTATCACAATCCTGATTC 

1021 * * 1080 

CCTTAATTGTTCTTCTTATAATCGTTCCTTTTTCTTCTTCTCGATAGTGTTAGGACTAAG 

a CINfJKNiSNEKEEELSOSCF 

b ELTTR IlATKKKKSYHNPDS- 

c N"QQEY-Q2KR3RAITILIL- 
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TTAAAGATTTCAAAAATTCCAGGTAAGAGAGATACATTCATTAAAATTCATATATTATAG 
AATTTCTAAAGTTTTTAAGGTCCATTCTCTCTATGTAAGTAATTTTAAGTATATAATATC 



LKISK I PGKRDTF I K I H I L 
* RFQKFQVREIHSLKF IYYS 
KDFKNSR * ERYIH * N S Y I IV- 

TTTTTCATTTCACAGCTGTTATTTTCTTTTATCTTAACAATATTTTTTGATTAGCTGGAA 

1141 ♦ ♦ ♦ 1200 

AAAAAGTAAAGTGTCGACAATAAAAGAAAATAGAATTGTTATAAAAAACTAATCGACCTT 

FFISQLLFSFILTIFFD*LE 
FSFHSCYFLLS * Q Y F L I S W K - 
FHFTAVI FFYLNN IFCLAGS- 



GTAAAAAGTATCAAATAAGAGAAGCGCTAGACTGAGGTAACTTAGCTTATTCACATTCAT 



1201 



1260 



CATTTTTCATAGTTTATTCTCTTCGCGATCTGACTCCATTGAATCGAATAAGTGTAAGTA 

VKSIK* EKR *TEVT * h IH IH 

* KVSNKRSARLR* LSLFTFI 
KKYQIREALDCGNLAYSHS * - 

AGATCGACCTTCATATATCCAATACGATGATAAGGAAACAGCAGTCATCCGTTTTAAAAA 

1261 . ♦ + ♦ ♦ 1320 

TCTAGCTGGAAGTATATAGGTTATGCTACTATTCCTTTGTCGTCAGTAGGCAAAATTTTT 

RSTFIYPIRC*GNSSHPF-K 
DRPSYIQYDDKETAVI R F K N - 
IDLHISNTMI RKQQSSVLKI- 

TAGTGCTATGAGGACTAAATTTTTAGAGTCAAGAAATGGAGCCGAAATCTTAATCAAAAA 

132 i + 1380 

ATCACGATACTCCTGATTTAAAAATCTCAGTTCTTTACCTCGGCTTTAGAATTAGTTTTT 

CYF-D" IFRVKKWSRNLNQK 
SAMRTKFLESRNGAEILIKK- 
VLCGLNF SQEMEPKS ' SKR- 

GAATTGCGTCGATATTGCAAAAGAATCGAACTCTAAATCTTTCGTTAATAAGTATTACCA 

138 1 * * 1^0 

CTTAACGCAGCTATAACGTTTTCTTAGCTTGAGATTTAGAAAGCAATTATTCATAATGGT 

ELRRYCKRIEL'IFR* * V L P 
NCVDIAKESNSKSFVNKYYO- 
IASILQKNRTLNLSL IS I T N - 

ATCTTGATTGATTGAAGAGATTGACGAGGCAACTGCACAGAAGATCATTAAAGAAATAAA 

1441 ♦ * L500 

TAGAACTAACTAACTTCTCtAACTGCTCCGTTGACGTGTCTTCTAGTAATTTCTTTATTT 

ILIDCRD * RGNCTEDH * R N X 
SCLIEEIDEATAOKT IKEIK- 
LDCLKRLTRQLHRRS LKK " S - 

GTAACTTTTATTAATTAGAGAATAAACTAAATTACTAATATAGAGATCAGCGATCTTCAA 

150 1 1^60 

CATTGAAAATAATTAATCTCTTATTTGATTTAATGATTATATCTCTAGTCGCTAGAAGTT 

VTFIN'RIN'ITNEEISDLQ 

* LLLIRE'TKLLI'RSAIFN- 
NFY'LENKLNY * YRDQRSS I - 

TTGACGAAATAAAAGCTGAACTAAAGTTAGACAATAAAAAATACAAACCTTCGTC AAAAT 

l561 1620 

AACTGCTTTATTTTCGACTTGATTTCAATCTGTTATTTTTTATGTTTGGAACCAGTTTTA 

LTK'KLN'S-TIKNTNLGQN 
CRNKSCTKVRQ'KIQTLVKI 
DE IKAELKLDNKKYK PWSKY- 

ATTGAGGAAGGAAAAGAAGACCAGTTAGCAAAAGAAAAAATAACGCAATAAATAAAATGA 

1621 * 1680 

TAACTCCTTCCTTTTCTTCTGGTCAATCGTTTTCTTTTTTATTCCCTTATTTATTTTACT 

IEEGKEOQLAKEKIR0-1KC 
LRKEKKTS'QKXK'GWK-NE- 
CCRKRRPVSKRKNKA I N K M S - 
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CTACACAAGTGAAGAAATAAAAGATTTATTTTTTTCAATAATTTATTGAAAAGAGGGGTT 

1681 ♦ «■ ♦ - ♦ ♦ 1740 

CATGTCTTCACTTCTTTATTTTCTAAATAAAAAAAGTTATTAAATAACTTTTCTCCCCAA 

a VQKCRNKRFIFFNNLLKRGV 

b YRSEEIKDLFFSI IYCKEGF- 

c TEVKK'KIYFFQ* FIEKRGF- 

TTGGGGTTTTGGGGTTTTGGGG 
1741 * + 176 2 

AACCCCAAAACCCCAAAACCCC 

a LGFWGFG 
b WGFGVLG 
C G V L G F W 



FIGURE 13 



2 EVDVDNQADNHG I HSAJLKTC EE I KEAKTLYSWIQKV IRCRNQSQSH Y K.DL 51 

\:::: \.:.::\: :| |.::| - I l -M l _ 

19 ELELEMQENQNDIQVRVK. . . . IDDPKQY. . LVNVTAACLLQEGSYYQDK 62 

52 EDIKIFAQTNIVATPRDYNEEDFKVIARKEVF. STGLMIELIDKCLVELL 100 

:: :.: : -|-| h -I I ...::|.: 

63 DERRYI ITKALL . . . . EVAESDPEFICQLAVYIRNELYIRTTTNYIVAF . 107 

101 SSSDVSDRQKLQCFGFQLKGNQLAXTHLLTALSTQKQYFFQDEWNQVRAM 150 

108 CWHKNTOPFIEKYFNKAVLLPNDLLEVCEFAQVLYI 144 

151 IGNELFRHLYTK YL I FQRTS EGTLVQFCGNNVFDHLKVNDKFDKKQKGGA 200 

|::|| I .: :■ = : --I---- =:= - 

145 FDATEFKNLY LDR I LSQDIRKELTFRKCLQRCVRSKF 181 

201 ADMNE PRCCSTCKYNVK^/EKDHFLNNI^A/PNWNNMKSRTRIFYCTHF 247 

-::|| . : : | . . : - I--- -I- -I -I- I- 

182 SEFNEYQLGKYCTES . . QRKKTMFRYLSVTNKQKWDQTKKK 220 

24 8 NRNNOFFKKHEFVSNKNNISAMDRAQTIFTNIFRFNRIRKKLKDKVIEKI 2 97 

|. | | .: : : I : . . : 1 II- h II 

221 . RKENLLTKLQAI KESEDKSKRETG DIMNVEDAIKALKPAVMKKI 2 64 

298 AYMLEKVKDFNFNYYLTKSCPLPENWRERKQKIENLINKTREEKSKYYEE 347 

| - : :| - I - I I I I 

2 65 AKRQNAMK KHMKAPKI PNSTLESKYLTFKD 294 
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FIGURE 19 



1 aactcattta attactaatt taatcaacaa gattgataaa aagcagtaaa taaaacccaa 
61 tagatttaat ttagaaagta tcaattgaaa aatggaaatt gaaaacaact aagcacaata 
121 gccaaaagcc gaaaaattgt ggtgggaact tgaattagag atgcaagaaa accaaaatga 
1 8 1 tatataagtt agggttaaga ttgacgatcc taagcaatat ctcgtgaacg tcactgcagc 
24 1 atgtttgttg taggaaggta gttactacta agataaagat gaaagaagat atatcatcac 
301 taaagcactt cttgaggtgg ctgagtctga tcctgagttc atctgctagt tggcagtcta 
361 catccgtaat gaactttaca tcagaactac cactaactac attgtagcat tttgtgttgt 
42 1 ccacaagaat actcaaccat tcatcgaaaa gtacttcaac aaagcagtac ttttgcctaa 
481 tgacttactg gaagtctgtg aatttgcata ggttctctat atttttgatg caactgaatt 
541 caaaaatttg tatcttgata ggatactttc ataagatatt cgtaaggaac tcactttccg 
601 taagtgttta caaagatgcg tcagaagcaa gttttctgaa ttcaacgaat actaacttgg 
66 1 taagtattgc actgaatcct aacgtaagaa aacaatgttc cgttacctct cagttaccaa 
72 1 caagtaaaag tgggattaaa ctaagaagaa gagaaaagag aatctcttaa ccaaacttta 
781 ggcaataaag gaatctgaag ataagtccaa gagagaaact ggagacataa tgaacgttga 
84 1 agatgcaatc aaggctttaa aaccagcagt tatgaagaaa atagccaaga gatagaatgc 
901 catgaagaaa cacatgaagg cacctaaaat tcctaactct accttggaat caaagtactt 
961 gaccttcaag gatctcatta agttctgcca tatttctgag cctaaagaaa gagtctataa 
1021 gatccttggt aaaaaatacc ctaagaccga agaggaatac aaagcagcct ttggtgattc 
1081 tgcatctgca cccttcaatc ctgaattggc tggaaagcgt atgaagattg aaatctctaa 
1141 aacatgggaa aatgaactca gtgcaaaagg caacactgct gaggtttggg ataatttaat 
1201 ttcaagcaat taactcccat atatggccat gttacgtaac ttgtctaaca tcttaaaagc 
1261 cggtgtttca gatactacac actctattgt gatcaacaag atttgtgagc ccaaggccgt 
1321 tgagaactcc aagatgttcc ctcttcaatt ctttagtgcc attgaagctg ttaatgaagc 
1381 agttactaag ggattcaagg ccaagaagag agaaaatatg aatcttaaag gtcaaatcga 
1441 agcagtaaag gaagttgttg aaaaaaccga tgaagagaag aaagatatgg agttggagta 
1501 aaccgaagaa ggagaatttg ttaaagtcaa cgaaggaatt ggcaagcaat acattaactc 
1561 cattgaactt gcaatcaaga tagcagttaa caagaattta gatgaaatca aaggacacac 
1 62 1 tgcaatcttc tctgatgttt ctggttctat gagtacctca atgtcaggtg gagccaagaa 
1681 gtatggttcc gttcgtactt gtctcgagtg tgcattagtc cttggtttga tggtaaaata 
1 74 1 acgttgtgaa aagtcctcat tctacatctt cagttcacct agttctcaat gcaataagtg 
1801 ttacttagaa gttgatctcc Gtggagacga actccgtcct tctatgtaaa aacttttgca 
1861 agagaaagga aaacttggtg gtggtactga tttcccctat gagtgcattg atgaatggac 
1 92 1 aaagaataaa actcacgtag acaatatcgt tattttgtct gatatgatga ttgcagaagg 
1981 atattcagat atcaatgtta gaggcagttc cattgttaac agcatcaaaa agtacaagga 
204 1 tgaagtaaat cctaacatta aaatctttgc agttgactta gaaggttacg gaaagtgcct 
2101 taatctaggt gatgagttca atgaaaacaa ctacatcaag atattcggta tgagcgattc 
2161 aatcnaaag ttcatttcag ccaagcaagg aggagcaaat atggtcgaag ttatcaaaaa 
222 1 ctttgccctt caaaaaatag gacaaaagtg agtttcttga gattcttcta taacaaaaat 
228 1 ctcaccccac ttttttgttt tattgcatag ccattatgaa atttaaatta ttatctattt 
2341 atttaagtta cttacatagt ttatgtatcg cagtctatta gcctattcaa atganctgc 
2401 aaagaacaaa aaagattaaa a 



FIGURE 20 

MEIENNQAQQPKAEKLWWELELEMQENQNDIQVRVKIDDPKQYL 

WIVTAACLLQEGSYYQDKJDEPvRYIITKALLEVAESDPEFICQLAVYIRNELYIRTTTN 

YIVAFCWHKNTQPFIEKYFN1CAVLLPNDLLEVCEFAQVLYIFDATEFKNLYLDRILS 

QDIRXELTFRKCLQRCVRSKJSEFNEYQLGKYCTESQRKKTMFRYLSVTNKQKWDQTK 

KKJRKENLLTKLQAIK£SEDKSKJR£TGDIMNVEDAIKALKPAVMK^ 

APKIPNSTLESKYLTFKDLIKFCHISEPKERVYKILGKKYPKTEEEYKAAFGDSASAP 

FNPELAGKRMKIEISKTWENELSAKGNTAEVWDNLISSNQLPYMAMLRNLS>fILKAGV 

SDTTHSIVINKICEPKA VEN S KMFPLQFFS AIEAVNEA VTKGFKAKKRENMNLKGQIE 

AVKEVVEKTDEEKKDMELEQTEEGEFVKVNEGIGKQYINSIELAIK1AVNKNLDEIKG 

HTAIFSDVSGSMSTSMSGGAKKYGSVRTCLECALVLGLMVKQRCEKSSFY1FSSPSSQ 

CNKCYLEVDLPGDELRPSMQiaLQEKGKLGGGTDFPYECIDEWTKNKTHVDNlVILSD 

MMIAEGYSDIWRGSSIVNSIKKYKDEVNPNIKIFAVDLEGYGKCLM.GDEFNENNYI 

KJFGMSDSILKFISAKQGGANMVEVIKNFALQKJGQK 



FIGURE 21 

1 tcaatactat taattaataa ataaaaaaaa gcaaactaca aagaaaatgt caaggcgtaa 
61 ctaaaaaaag ccataggctc ctataggcaa tgaaacaaat cttgattttg tattacaaaa 
121 tctagaagtt tacaaaagcc agattgagca ttataagacc tagtagtaat agatcaaaga 
1 8 1 ggaggatctc aagcttttaa agttcaaaaa ttaagattag gatggaaact ctggcaacga 
241 tgatgatgat gaagaaaaca actcaaataa ataataagaa ttattaagga gagtcaatta 
301 gattaagtag caagtttaat tgataaaaaa agttggttct aaggtagaga aagatttgaa 
361 tttgaacgaa gatgaaaaca aaaagaatgg actttctgaa tagcaagtga aagaagagta 
42 1 attaagaacg attactgaag aataggttaa gtattaaaat ttagtattta acatggacta 
481 ccagttagat ttaaatgaga gtggtggcca tagaagacac agaagagaaa cagattatga 
541 tactgaaaaa tggtttgaaa tatctcatga ccaaaaaaat tatgtatcaa tttacgccaa 
601 ctaaaagaca tcatattgtt ggtggcttaa agattatttt aataaaaaca attatgatca 
661 tcttaatgta agcattaaca gactagaaac tgaagccgaa ttctatgcct ttgatgattt 
72 1 ttcacaaaca atcaaactta ctaataattc ttactagacLgttaacatag acgttaattt 
781 tgataataat ctctgtatac tcgcattgct tagattttta ttatcactag aaagattcaa 
841 tattttgaat ataagatctt cttatacaag aaattaatat aattttgaga aaattggtga 
901 gctacttgaa actatcttcg cagttgtctt ttctcatcgc cacttacaag gcattcattt 
96 1 acaagncct tgcgaagcgt fctaatattt agttaactcc tcatcataaa ttagcgttaa 
1 02 1 agatagctaa ttataggtat actctttctc tacagactta aaattagttg acactaacaa 
1081 agtccaagat tattttaagt tcttataaga attccctcgt ttgactcatg taagctagta 
1141 ggctatccca gttagtgcta ctaacgctgt agagaacctc aatgttttac ttaaaaaggt 
1201 caagcatgct aatcttaatt tagtttctat ccctacctaa ttcaattttg atttctactt 
1261 tgttaattta taacatttga aattagagtt tggattagaa ccaaatattt tgacaaaaca 
1321 aaagcttgaa aatctacttt tgagtataaa ataatcaaaa aatcttaaat ttttaagatt 
1381 aaacttttac acctacgttg cttaagaaac ctccagaaaa cagatattaa aacaagctac 
1 44 1 aacaatcaaa aatctcaaaa acaataaaaa tcaagaagaa actcctgaaa ctaaagatga 
1501 aactccaagc gaaagcacaa gtggtatgaa attttttgat catctttctg aattaaccga 
1561 gcttgaagat ttcagcgtta acttgtaagc tacccaagaa atttatgata gcttgcacaa 
1 62 1 acttttgatt agatcaacaa atttaaagaa gttcaaatta agttacaaat atgaaatgga 
1681 aaagagtaaa atggatacat tcatagatct taagaatatt tatgaaacct taaacaatct 
1 74 1 taaaagatgc tctgttaata tatcaaatcc tcatggaaac atttcttatg aactgacaaa 
1801 taaagattct actttttata aatttaagct gaccttaaac taagaattat aacacgctaa 
1861 gtatactttt aagtagaacg aattttaatt taataacgtt aaaagtgcaaaaattgaatc 
1 92 1 ttccFcatta gaaagcttag -aagatartga tagtctttgc aaatctattg cttcttgtaa 
1981 aaatttacaa aatgttaata ttatcgccag tttgctctat cccaacaata tttagaaaaa 
2041 tcctttcaat aagcccaatc ttctattttt caagcaattt gaataattga aaaatttgga 
2101 aaatgtatct atcaactgta ttcttgatca gcatatactt aattctattt cagaattctt 
2161 agaaaagaat aaaaaaataa aagcattcat tttgaaaaga tattatttat tacaatatta 
222 1 tcttgattat actaaattat ttaaaacact tcaatagtta cctgaattaa attaagttta 
2281 cattaattag caattagaag aattgactgt gagtgaagta cataagtaag tatgggaaaa 
2341 ccacaagcaa aaagctttct atgaaccatt atgtgagttt atcaaagaat catcctaaac 
240 1 cctttagcta atagattttg accaaaacac tgtaagtgat gactctatta aaaagatttt 
2461 agaatctata tctgagtcta agtatcatca ttatttgaga ttgaacccta gttaatctag 
2521 cagtttaatt aaatctgaaa acgaagaaat ttaagaactt ctcaaagctt gcgacgaaaa 
2581 aggtgtttta gtaaaagcat actataaatt ccctctatgt ttaccaactg gtacttatta 
264 1 cgattacaat tcagatagat ggtgattaat taaatattag tttaaataaa tattaaatat 
2701 tgaatatttc tttgcttatt atttgaataa tacatacaat agtcattttt agtgttttga 
2761 atatatttta gttatttaat tcattatttt aagtaaataa ttatttttca atcatttttt 
2821 aaaaaatcg 
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MSRRMQKKPQ API GN ETN LDF VLQN LEVYKSQIEHYKTQQQQIK 

EEDLKLLKFKNQDQDGNSGNDDDDEENNSNKQQELLRRVNQIKQQVQLIKKVGSKVEK 

DLNLNEDENKKNGLSEQQVICEEQLRTITEEQVKYQNLVFNMDYQLDLNESGGHRRHRR 

ETDYDTEKW?EISI4DQKNYVSIYANQKTSYCWWLKDYFNK>JNYDHLNVSINRLETEAE 

FYAFDDFSQTIKLT>«fNSYQTVNIDVNFDNNLCILALLRFLLSLERFNILNIRSSYTRN 

QYNFEKIGELLETIFAVVFSHRHLQGIHLQVPCEAFQYLVNSSSQISVKDSQLQVYSF 

STDLKJLVDTNKVQDYFKJ^LQEFPPXTITVSQQAIPVSATNAVENLN\a.LKKVKHANLNL 

VSIPTQFNFDFYFVNLQHLKLEFGLEPNILTKQKLENLLLSIKQSKNLKFLRLNFYTY 

VAQETSRKQILKQATTIKNLKNNKNQEETPETKDETPSESTSGMKFFDHLSELTELED 

FSWLQATQEIYDSLHKLLIRSTNLKJ<JKLSYKYEMEKSKMDTFIDLKNIYETLNNLK 

RCSVNISNPHGNISYELTNKI)STFYKFK1TLNQELQHAKYTFKQNEFQFNNVKSAKIE 

SSSLESLEDIDSLCKSIASCKNLQNVNIIASLLYPNNIQKNPFNKPNLLFFKQFEQLK 

NLENVSrNCILDQHlLNSISEFLEKNICKIKAFILKRYYLLQYYLDYTKLFKTLQQLPE 

LNQVYINQQLEELTVSEVHKQVWENHKQKAFYEPLCEFIKESSQTLQLIDFDQNTVSD 

DSIKJCILESISESKYHHYLRLNPSQSSSLIKSENEEIQELLKACDEKGVLVKAYYKFP 

LC LPTGTY YBYNS D R W 
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MKILFEFIQDKJLDIDLQTNSTYKENLKCGHFNGLDEILTTCFAL 

PNSRKIALPCLPGDLSHKAVIDHCIIYLLTGELYNNVLTFGYKIARhfEDVNNSLFCHS 

ANVNVTLLKGAAWKMFHSLVGTYAFVDLLrNYTVIQFNGQFFTQIVGNRCNEPHLPPK 

WVQRSSSSSATAAQIKQLTEPVTNKQFLHKLNrNSSSFFPYSKlLPSSSSIKKLTDLR 

EAIFPTNLVIGPQRLKVRJWLTLQKJLLKjmKRLNYVSILNSICPPLEGTVLDLSHLSR 

QSPKJERVLKFIIVILQKLLPQEMFGSKXNKGKJIKNLNLLLSLPLNGYLPFDSLLKJCL 

RLKDFRWLFISDIWFTKHNFENLNQLAICFISWLFRQLIPKIIQTFFYCTEISSTVTI 

VYFRHDTVVhn<LITPFIVEYFKTYLVENNVCRNH^ 

IIAIPCRGADEEEFTIYKENHKNAIQPTQKJLEYLRNKRPTSFTKIYSPTQIADRIKE 

FKQRLLKKFNNVLPELYFMKFDVKSCYDSIPRMECMRILKDALKNENGFFVRSQYFFlsl 

TOTGVLKXFh^WASRVPKPYELYIDNVRTVHLSNQDVINVVEMEIFKTALWVEDKCY 

IREDGLFQGSSLSAPIVDLVYDDLLEFYSEFFCASPSQDTLILKLADDFLIISTDQQQV 

[NIKKLAMGGFQKYNAK^NRDKJLAVSSQSDDDTVIQFCAMmFVKELEVWKHSSTMN 

NFKQRSKSSKGIFRSLIALFNTRISYKTIDTNLNSTNTVLMQIDHVVKNISECYKSAF 

KDLSIMVTQNMQFHSFLQR1IEMTVSGCPITKCDPLIEYEVRFTILNGFLESLSSNTS 

KFKDNHLLRKEIQHLQAYIYIYIHIVN 



FIGURE 24 

Oxytricha LCVSYILSSFYYANLEENALQFLRKESMDPEKPETNLLMRLT 
Euplotes LCVSSILSSFYYATLEESSLGFLRDESMNPENPNVNLLMRLT 
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Motif 0 

AKPLHWLMSVYWELLRSFFYVTETTFQKKR 
+ ~ 1 TSElEWLVLGKRSNAKMCLSDFEKRKQIFAEriYWLYHSFIIPILCSFFYITBSSDLRNR 
till LKDFR^FIsr-^WFTKHKFEHLHQUAlCFISHLPRQLIPKIIQTFFYCTEISSTVT- 
llll TRE I SW1QVET - 8 AKHP YYFDHEN - 1 YVLMKLLRWI FEDL WSLI RCFF YVTEQQKS YSK 



Motif 1 

-DOL 



v ma n LFFYRKSVWSKLQSIOIRQHLKRVQLRDVSEAEVRQHREARPALLTSRLRFIPKP— DOL 



|S« TYYYRKNIWDVIMKMSI-ADLKKETLAEVQEKEVEEWKKS-LOFAPGKLRLIPKK--TTF 



Motif 2 

RPIVNMDYWGARTFRREKRAERLTSRVKALF-8VLNYERA 

RLITN-LRKRFLIKMOSNKKMLVSTNQTLRPVASILKHLIHEESSGIPFKLEVYMKLLTF 
E8T2 RT I AI PCRGADEEEFT I YKENHKHAI QPTQKI LEYLRNKRPTS FTK I YSPTQ I ADRI KEF 

S» Spi^fnkkivhsdrkttklttntkllrshlmijctlkn-rmfkdpfgpavfnyddvmkky 



3 Motif 3 (A) 

, vvntLKHRMFGR-KKYFVRlDIKSCYDRlKQDLKFRIVKK-KLKDPEPVIRKYATIHATS 

^lkk^pe^fmkfdvkscydsiprmecmrilkd-^ 

- * - * • * *« 
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ATTTATACTCATGAAAATCTTATTCGAGTTCATTCAAGACAAGCTTGACATTGATCTACA 
GACCAACAGTACTTACAAAGAAAATTTAAAATGTGGTCACTTCAATGGCCTCGATGAAAT 
TCTAACTACGTGTTTCGCACTACCAAATTCAAGAAAAATAGCATTACCATGCCTTCCTGG 
TGACTTAAGCCACAAAGCAGTCATTGATCACTGCATCATTTACCTGTTGACGGGCGAATT 
ATACAACAACGTACTAACATTTGGCTATAAAATAGCTAGAAATGAAGATGTCAACAATAG 
TCTTTTTTGCCATTCTGCAAATGTTAACGTTACGTTACTGAAAGGCGCTGCTTGGAAAAT 
GTTCCACAGTTTGGTCGGTACATACGCATTCGTTGATTTATTGATCAATTATACAGTAAT 
TCAATTTAATGGGCAGTTTTTCACTCAAATCGTGGGTAACAGATGTAACGAACCTCATCT 
GCCGCCCAAATGGGTCCAACGATCATCCTCATCATCCGCAACTGCTGCGCAAATCAAACA 
ACTTACAGAACCAGTGACAAATAAACAATTCTTACACAAGCTCAATATAAATTCCTCTTC 
TTTTTTTCCTTATAGCAAGATCCTTCCTTCATCATCATCTATCAAAAAGCTAACTGACTT 
G AG AG AAG CT ATTTTT C C C AC AAATTTGGTT AAAATT C CT C AG AG ACT AAAGGT ACG AAT 
TAATTTGACGCTGCAAAAGCTATTAAAGAGACATAAGCGTTTGAATTACGTTTCTATTTT 
GAATAGTATTTGCCCACCATTGGAAGGGACCGTATTGGACTTGTCGCATTTGAGTAGGCA 
ATCACCAAAGGAACGAGTCTTGAAATTTATCATTGTTATTTTACAGAAGTTATTACCCCA 
AGAAATGTTTGGCTCAAAGAAAAATAAAGGAAAAATTATCAAGAATCTAAATCTTTTATT 
AAGTTTACCCTTAAATGGCTATTTACCATTTGATAGTTTGTTGAAAAAGTTAAGATTAAA 
GGATTTTCGGTGGTTGTTCATTTCTGATATTTGGTTCACCAAGCACAATTTTGAAAACTT 
GAATCAATTGGCGATTTGTTTCATTTCCTGGCTATTTAGACAACTAATTCCCAAAATTAT 
ACAGACTTTTTTTTACTGCACCGAAATATCTTCTACAGTGACAATTGTTTACTTTAGACA 
TGATACTTGGAATAAACTTATCACCCCTTTTATCGTAGAATATTTTAAGACGTACTTAGT 
CGAAAACAACGTATGTAGAAACCATAATAGTTACACGTTGTCCAATTTCAATCATAGCAA 
AATGAGGATTATACCAAAAAAAAGTAATAATGAGTTCAGGATTATTGCCATCCCATGCAG 
AGGGG C AG A CG AAG AAG AATT C AC AATTT AT AAGG AG AAT C A C AAAAATG CT AT C C AG C C 
CACTCAAAAAATTTTAGAATACCTAAGAAACAAAAGGCCGACTAGTTTTACTAAAATATA 
TTCTCCAACGCAAATAGCTGACCGTATCAAAGAATTTAAGCAGAGACTTTTAAAGAAATT 
TAATAATGTCTTACCAGAGCTTTATTTCATGAAATTTGATGTCAAATCTTGCTATGATTC 
CATACCAAGGATGGAATGTATGAGGATACTCAAGGATGCGCTAAAAAATGAAAATGGGTT 
TTTCGTTAGATCTCAATATTTCTTCAATACCAATACAGGTGTATTGAAGTTATTTAATGT 
TGTTAACGCTAGCAGAGTACCAAAACCTTATGAGCTATACATAGATAATGTGAGGACGGT 
TCATTTATCAAATCAGGATGTTATAAACGTTGTAGAGATGGAAATATTTAAAACAGCTTT 
GTGGGTTGAAGATAAGTGCTACATTAGAGAAGATGGTCTTTTTCAGGGCTCTAGTTTATC 
TGCTCCGATCGTTGATTTGGTGTATGACGATCTTCTGGAGTTTTATAGCGAGTTTAAAGC 
CAGTCCTAGCCAGGACACATTAATTTTAAAACTGGCTGACGATTTCCTTATAATATCAAC 
AGACCAACAGCAAGTGATCAATATCAAAAAGCTTGCCATGGGCGGATTTCAAAAATATAA 
T G C G AAAG C C AAT AG AG AC AAAATTTT AG C CGT AAG CT C C C AAT C AG ATG ATG AT ACGGT 
TATTCAATTTTGTGCAATGCACATATTTGTTAAAGAATTGGAAGTTTGGAAACATTCAAG 
CACAATGAATAATTTCCATATCCGTTCGAAATCTAGTAAAGGGATATTTCGAAGTTTAAT 
AG C G CTGTT T AAC ACT AG AAT CT CTT AT AAAAC AATTG AC A C AAATTT AAATT C AAC AAA 
CACCGTTCTCATGCAAATTGATCATGTTGTAAAGAACATTTCGGAATGTTATAAATCTGC 
TTTTAAGGATCTATCAATTAATGTTACGCAAAATATGCAATTTCATTCGTTCTTACAACG 
CATCATTGAAATGACAGTCAGCGGTTGTCCAATTACGAAATGTGATCCTTTAATCGAGTA 
TGAGGTACGATTCACCATATTGAATGGATTTTTGGAAAGCCTATCTTCAAACACATCAAA 
ATTTAAAGATAATATCATTCTTTTGAGAAAGGAAATTCAACACTTGCAAGC 
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AKFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIGIRQHLKR 

VQLRDVSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYVVGARTFRREKR 

AERLTSRVKALFSVLNYERA 
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GCCAAGTTCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTC 

TTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACC 

GGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGGAATCAGACAGCACTTGAA 

GAGGGTGCAGCTGCGGGACGTGTCGGAAGCAGAGGTCAGGCAGCATCGGGA 

AGCCAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGACG 

GGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCG 

CAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGC 

GTGCTCAACTACGAGCGGGCGCG 
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MTEHHTPKSRILRFLENQYVYLCTLNDYVQLVLRGSPASSYSNICERLRSDVQTSFSIFLHSTVVGF 
DSKPDEGVQFSSPKCSQSELIANVVKQMFDESFERRRNLLMKGFSMNHEDFRAMHVNGVQNDLV 
STFPNYLISILESKNWQLLLEIIGSDAMHYLLSKGSIFEALPNDNYLQISGIPLFKNNVFEETVSKKRK 
RTIETSITQNKSARJ<£VSWNSISISRFSIFY 

LINAFQVKQLHKVIPLVSQSTVVPKRLLKVYPLIEQTAKRLHR1SLSKVYNHYCPYIDTHDDEKILS 
YSLKPNQVFAFLRSILVRVFPKLIWGNQRJFEIILKiDLETFLKLSRYESFSLHYLMSNIKISEIEWLVL 
GKJISNAKMCLSDFEKJIKQIFAEFIYWLYTS^ 

SMKMEAFEKINENhTVRMDTQKTTLPPAVIRLLPKJCNTFRJLITNLRKRFLIKM 

LRPVASILKHLINEESSGIPFNLEVYMKLLTFKKDLLIOIRMFGRKKYFVRIDIKSCYDRIKQDLMFR 

IVKKKLKDPEFVIRKYATIHATSDRATKNFVSEAFSYFDMVPFEKVVQLLSMKTSDTLFVDFVDY 

WTKSSSEIFKMLKJEHLSGHIVKIGNSQYLQKVGIPQGSILSSFLCHFYMEDLIDEYLSFTKKKGSVL 

LRVVDDFLFITVNKJODAKKFLNLSLRGFEKHNFSTSLEKTVINFENSNGIIhW 

FSVNMRSLDTLLACPKIDEALFNSTSVELTKHMGKSFFYKILRSSLASFAQVFIDITHNSKFNSCCNI 

YRLGYSMCMRAQAYLKJIMKI3IFIPQRMFITO 

LGMRDGLKPSFKYHPCFEQLIYQFQSLTDLIKPLRPVLRQVLFLHRRJAD 
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ggtaccgamacmccmcncataagctaattgcncctcgaacgctcctaaatctctggaaatattmacaagaactcaataacaataccaagtcaaattccaatatg 

tgttattagtgatcgataatamctatmatcggtcgttaccaagtataaggacaaaaagaacaacttccttccccctaaagactmacmanaatttacttttcaaatatatttcg 

ggttcgcnacttttaatcgtggtactgtmagctgctacttctagccaaccgcgtgtttctaccccgtcanggatatagctcuggagtagctcacagaaatccttacaaatctt 

ctgatgagactatattagattcattacagtccgtgcatattcttaacatggagccttacactttagatgagtcacgtcgcatgatggagtatttggtatcatccaacgtttgccttg 

aaaaggngataattatttgcaaaatcatgtccttagtggtggtaatccgcgaaagttttttgatgcttgcacacgtctagcatgattgagatattcaaaaatttctatccactacaa 

ctccmaacgcggtmattmctatmctattctcatgttgttccaaatatgtatcatctcgtattaggctttmccgtmactcctggaatcgtaccttmca 

aataatctaaattagmcgcttataattgatagtagtagaaagattggtgattctactcgtgtaatgttattagtttaaagatactttgcaaaacatttattagctatcattatataaaa 

aaaatcctataattataaatattaatcaatatttgcggtcactatttatttaaaacgttatgatcagtaggacactttgcatatatatagttatgcttaatggttacttgtaacttgcAT 

GACCGAACACCATACCCCCAAAAGCAGGATTCTTCGCTTTCTAGAGAATCAATATGTATACCTATGTA 

CCTTAAATGATTATGTACAACTTfeTTTTGAGAGGGTCGCCGGCAAGCTCGTATAGCAATATATGCGAA 

CGCTTGAGAAGCGATGTACAAACGTCCTTTTCTA'ri'rriCTTCATTCGACTGTAGTCGGCTTCGACAGT 

AAGCCAGATGAAGGTGTTCAATTTTCTTCTCCAAAATGCTC 

ggatagctaatatatgggcagCTAATAGCGAATGTTGTAAAACAGATGTTCGATGAAAGTTTTGAGCGTCGAAGGA 

ATCTACTGATGAAAGGGTTTTCCATGgtaaggtattcmttgtgaaatatttacctgcaattactgmcaaagagattgtatttaaccgataaagAA 

TCATGAAGATTTTCGAGCCATGCATGTAAACGGAGTACAAAATGATCTCGTTTCTACTTTTCCTAATTA 

CCTTATATCTATACTTGAGTCAAAAAATTGGCAACTTTTGTTAGAAATgtaaataccggnaagatgttgcgcactttgaaca 

agactgacaagtatagTATCGGCAGTGATGCCATGCATTACTTATTATCCAAAGGAAGTATTTTTGAGGCTCTTC 

CAAATGACAATTACCTTCAGATTTCTGGCATACCAC rriMM AAAAATAATGTGTTTGAGGAAACTGTGT 

CAAAAAAAAGAAAGCGAACCATTGAAACATCCATTACTCAAAATAAAAGCGCCCGCAAAGAAGTTTC 

CTGGAATAGCATTTCAATTAGTAGGTTTAGCATTTTTTACAGGTCATCCTATAAGAAGTTTAAGCAAGgt 

aactaatactgnatccncataactaatmagATCTATATTTTAACTTACACTCTATTTGTGATCGGAACACAGTACACATG 

TGGCTTCAATGGATTTTTCCAAGGCAATTTGGACTTATAAACGCATTTCAAGTGAAGCAATTGCACAA 

AGTGATTCCACTGGTATCACAGAGTACAGTTGTGCCCAAACGTCTCCTAAAGGTATACCCTTTAATTGA 

ACAAACAGCAAAGCGACTCCATCGTATTTCTCTATCAAAAGTTTACAACCATTATTGCCCATATATTGA 

CACCCACGATGATGAAAAAATCCTTAGTTATTCCTTAAAGCCGAACCAGGTGTTTGCGTTTCTTCGATC 

CATTCTTGTTCGAGTGTTTCCTAAATTAATCTGGGGTAACCAAAGGATATTTGAGATAATATTAAAAGg 

tattgtataaaamanaccactaacgatmaccagACCTCGAAACTTTCTTGAAATTATCGAGATACGAGTCTTTTAGT^ 

ATTATTTAATGAGTAACATAAAGgtaatatgccaaatttttttaccattaattaacaatcagATTTCAGAAAT^ 

CCTTGGAAAAAGGTCAAATGCGAAAATGTGCTTAAGTGATTTTGAGAAACGCAAGCAAATATTTGCGG 

AATTCATCTACTGGCTATACAATTCGTTTATAATACCTATTTTACAATC 1 I 1 1 I 1 II ATATCACTGAATC 

AAGTGATTTACGAAATCGAACTGTTTATTTTAGAAAAGATATTTGGAAACTCTTGTGCCGACCCTTTAT 

TACATCAATGAAAATGGAAGCGTTTGAAAAAATAAACGAGgtattttaaagtattttttgcaaaaagctaatattttcagAACAA 

TGTTAGGATGGATACTCAGAAAACTACTTTGCCTCCAGCAGTTATTCGTCTATTACCTAAGAAGAATAC 

CTTTCGTCTCATTACGAATTTAAGAAAAAGATTCTTAATAAAGgtatta 

cagATGGGTTCAAACAAAAAAATGTTAGTCAGTACGAACCAAACTTTACGACCTGTGGCATCGATACTG 

AAACATTTAATCAATGAAGAAAGTAGTGGTATTCCATTTAACTTGGAGGTTTACATGAAGCTTCTTACT 

TTTAAGAAGGATCTTCTTAAGCACCGAATGTTTGGgtaattatataatgcgcgattcctcattattaatmgcagGCGT^ 

TATTTTGTACGGATAGATATAAAATCCTGTTATGATCGAATAAAGCAAGATTTGATGTTTCGGATTGTT 

AAAAAGAAACTCAAGGATCCCGAATTTGTAATTCGAAAGTATGCAACCATACATGCAACAAGTGACCG 

AGCTACAAAAAACTTTGTTAGTGAGGCGTTTTCCTATTgtaagmatttmcattggaatttmaacaaatt^ 

GGTGCCTTTTGAAAAAGTCGTGCAGTTACTTTC 

GGATTATTGGACCAAAAGTTCTTCTGAAATTTTTAAAATGCTCAAGGAACATCTCTCTGGACACATTGT 
TAAGgtataccaattgttgaattgtaataacactaatgaaactagATAGGAAATTCTCAATACCTTCAAAAAGTTGGTATCCCTC 
AGGGCTCAATTCTGTCATC 1TTI 11GTGTCATTTCTATATGGAAGATTTGATTGATGAATACCTATCGTT 
TACGAAAAAGAAAGGATCAGTGTTGTTACGAGTAGTCGACGATTTCCTCTTTATAACAGTTAATAAAA 
AGGATGCAAAAAAATTTTTGAATTTATCTTTAAGAGgtgagttgctgt^^ 

ACACAATTTTTCTACGAGCCTGGAGAAAACAGTAATAAACTTTGAAAATAGTAATGGGATAATAAACA 
ATACTTTTTTTAATGAAAGCAAGAAAAGAATGCCATTCTTCGGTTTCTCTGTGAACATGAGGTCTCTT 
ATACATTGTTAGCATGTCCTAAAATTGATGAAGCCTTATTTAACTCTACATCTGTAGAGCTGACGAAAC 
ATATGGGGAAATCTTTTTTTTACAAAATTCTAA 

ATCCTTTGCACAAGTATTTATTGACATTACCCACAATTCAAAATTCAATTCTTGCTGCAATATATATAG 
GCTAGGATACTCTATGTGTATGAGAGCACAAGCATACTTAAAAAGGATGAAGGATATATTTATTCCCC 
AAAGAATGTTCATAACGGgtgagtacttatmaactagaaaagtcattaattaaccttagATCTTTTGAATGTTATTGGAAGAAAA 
ATTTGGAAAAAGTTGGCCGAAATATTAGGATATACGAGTAGGCGTTTCTTGTCCTCTGCAGAAGTCAA 
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ATGgtacgtgtcggtctcgagacttcagcaatattgacacatcagGCTTTT^ 

TCAAATATCATCCATGCTTCGAACAGCTAATATACCAATTTCAGTCATTGACTGATCTTATCAAGCCGC 

TAAGACCAGTTTTGCGACAGGTGTTATTTTTACATAGAAGAATAGCTGATTAAtgtcattttcaatttattatatacatcctt 

tattactggtgtcnaaacaatattattactaagtatagctgacccccaaagcaagcatactataggatttctagtaaagtaaaattaatctcgttattagttttgattgacttgtcttt 

atccnatacttttaagaaagattgacagtggttgctgactactgcccacatgcccattaaacgggagtggttaaacattaaaagtaatacatgaggctaatctcctttcatttag 

aataaggaaagtggttttctataatgaataatgcccgcactaatgcaaaaagacgaagattatcttctaaacaagggggattaagcatatccgaaggaaaagagagtaatat 

acccagtgttgngaagaaagcaaggataatttggaacaagcttctgcagatgacaggctaaattttggtgaccgaattttggtaaaagccccaggttatccatggtggccg 

gccttgctactgagacgaaaagaaactaaggatagtttgaatactaatagctcatttaatgtcttatataaggttttgttttttcctgacttcaattttgcatgggtgaaaagaaata 

gtgttaagccattattggattccgaaatagccaaamcttggttcctcaaagcggaagtctaaagaacttattgaagcttatgaggcttcaaaaactcctcctgamaaa 

gaatcttccaccgatgaggaaatggatagcttatcagctgctgaggagaagcctaatttmgcaaaaaagaaaatatcattgggagacatctcngatgaatcagatgcgga 

gagtatctccagcggatccttgatgtcaataacttctatttctgaaatgtatggtcctactgtcgcttcgacttctcgtagctctacgcagttaagtgaccaaaggtacc 
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EST2 pep 

Euplotes pep 

Trans of tetrahymen 

Consensus 



FFYCTEISST VTIVYFRHDT WN KLIT V FIVE YFK-TYLVEN 40 

FFYVTEQQKS YSKTYYYRKN IWDVI-MKMS IAD LKK ETLA--EVQE 43 

KHKE GSQIFYYRKP IWKLVSKLTI VKVRIQr SEK NKQMXNNFYQ 44 

FFY.TE..K. . S * . YYYKK . IW...-KL ---F..K V. . 50 



EST2 pep 

Euplotes pep 

Trans of tetrahymen 

Consensus 



NVCRNHNSY- i^in™™*^. 

KEVEfiWKKSL GFAPqKP? 

KIQUIEENLE KVEEKLIPED SFQKYPi 

.E F. .<M 



K 



fRLIS] 



IMPCRG 79 

- F^HiOTFNKK 78 

i?R£ I yiTFLRK 92 

ESIB2MTF . RK 100 



EST2 pep . 

Euplotes pep 

Trans of tetrahymen 

Consensus 



ADEEBFTIYK ENHKNAIQP^ 
rVNSDRXTTK LTTWTKLLW|^ * 
DKQKNIK U^UOQILMD^^IATRls|<S- 

, ..J-.K..K LN.N. .L. .S^QL.L. .IIkN- 



;qkile\J]^nk rptsftkiys ptqiadrike 

RMFK -DPFGFAVFN 

ML-G -QXIGYSVFfi 



.IG. .VF'i 



129 
120 
130 

150 



EST2 pep 

Euplotes pep 

Trans of tetrahymen 

Consensus 



FKQRLLKKFN 
YD-DV>Q < ispE 



-qiseSfa 



.K-- 



NVL -PRE 

EFVCKWKQVG g 
QFIEKWKNKG F 



.F. .KWK. -G 



la. 



sTFMKFD VKSCYD 
FATMD "CEKCYD 
fYVTL- 



EtfF.T.D 



. CYD 



157 
155 
158 

186 
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S- 1 : FFY VTE TTF QKN RLF FYR KS V WSK 
S-2: RQHLKRVQLRDVSEAEVRQHREA 
S-3: ART FRR EKR AER LTS RVK ALF SVL NYE 

A-l : AKF LHW LMS VYV VEL LRS FFY VTE TTF Q 
A-2: LFF YRK SVW SKL QSI GJR QHL KRV QLR DVS 
A-3: PAL LTS RLR FIP KPD GLR PIV NMD YW 
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Vector Genomic DNA Insert 



A5 



B2 91 



5825 bp Sequenced 
-2 kb Hind 111 Fragment 




Vector 



1 kb 



13 



RT Motifs 1 2 3(A) 4(B') 5(C) 6(D) 
1/ \ \ i 



1:11 LJt! !::;•; 



Introns 



2 i 



<pDNA 
5*201 




CDNA 




RT-PCR w/ M2-B14 
RT-PCR w/ M2-B15BC 
RT-PCR w/ M2-B15 Be 
RT-PCR w/ M2-B16 



nc 
nc 
Banc 



Hind III 



500 bp 
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Polv 4 

t t c 

ta a g c c teg 
5 ' - cag acc aaa gga att cca taa gg -3 ' 
QTKGIPQG 



4(B') 



5(C) 

D D Y L L I T 

3'- ctg ctg atg gag gag tag tgg -5 

a a aaaaa a a 

t t t t 

c c 
Polv 1 
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FIGURE 36 



PCR Product M2 showed Reasonable Match 
with Other Telomerase Proteins 



Ot LCVSYILSSFYYANLEENALQFLRKESMDPEKPETNLLMRLT 
Ea_pl23 KGIPQGLCVSSILSSFYYATLEESSLGFLRDESMNPENPNVNLLMRLTDDYLLIT 

Sp_M2 SILSSFLCHFYMEDLIDEYLSFTKKK GSVLLRW 

Sc_pl03 DGLFQGSSLSAPIVDLVYDDLLEFYSEFKASPS -QDTLILKLADDFLIIS 

*. * 



QKVGIPQG 

caa aaa gtt ggt ate cct cag gg 

Poly 4 

t t c 

ta a g c c teg 

cag acc aaa gga att cca taa gg > 

ag acc aaa gga att cca tea ggC TCA ATT 
tc tgg ttt cct taa ggt agt ccG AGT TAA 

KGIPSGSI 



< Actual Genomic Sequence. 



CTG TCA TCT TTT TTG TGT CAT TTC TAT ATG 
GAC AGT AGA AAA AAC ACA GTA AAG ATA TAC 

LSSFLCHFYM 



GAA GAT TTG ATT 
CTT CTA AAC TAA 

E D L I 



GAT GAA TAC CTA 
CTA CTT ATG GAT 

D E Y L 



TCG TTT ACG AAA 
AGC AAA TGC TTT 

S F T K 



AAG AAA GGA TCA 
TTC TTT CCT AGT 

K K G S 



GTG TTG TTA CGA 
CAC AAC AAT GCT 

V L L R 



GTA GTC gac gac tac etc etc ate acc 
CAT CAG ctg ctg atg gag gag tag tgg 

VVDDYLLIT 



< ctg ctg atg 

aaa 



gag 


gag 


tag 


tgg 


a a 


a a 


a 


a 


t 


t 


t 


t 


c 


c 







Poly 1 



gac gat ttc etc ttt ata aca 
D D F L F I T 



< Actual Genomic Sequence . 



# 
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3' RT PCR Strategy 



mRNA 



5' 



I AAAAAAAAAAAAAAAA . 
TTTTTTTTTTTTTTT 



1 . Synthesis of cDNA with Or Primer. 
mRNA 



I AAAAAAAAAAAAAAAA. 

I TT , iT'rrT' r ' r i r 'i' , i ,, r TT 



2. First Round PCR Using Outside Primer and Qq Primer. 




3. Second Round PCR Using inside Primer and Q| Primer. 



4. Sequence Second Round PCR Products Using Inside Primer or Q| 
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-Size Selected Libraries from P. Nurese 
3 ~ 4 kb 
5 - 6 kb 
7 - 8 kb 
11 - 12 kb 

-Libraries from J.A. Wise 
Sau 3a Partial Digest 
Hind III Partial Digest 

cDNA Libraries 

GAD (Gal Activation Domain) Library 
REP Library from R. Allshire 
REP81ES Library (old) 
REP81ES Library (new) 
REP41ES Library 




-603 

■310 
• 234 
194 



Original PCR 
3' RT-PCR 




500 bp 



Mg Cone 
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• 
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5' RT PCR Strategy 



5' 



mRNA 



I AAAAAAAAAAAAAAAA 



ddA- 



-P 



1 . Synthesis of cDNA with Specific Downstream Primer. 

mRNA 




AAAAAAAAAAAAAAAA 



2. Ligate Oiigo with 5'-P and blocked 3" to cDNA using T4 RNA Ugase. 



3. First Round PCR 




4. Second Round PCR 
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Alignment of RT Domains from Telomerase Catalytic Subunits. 

Motif 0 

S.p. Tezlp (429). WLYNSFIIPILQSFFYITESSDLRNRTVYFRKDIW ...(35)... 
S.c. Est2p (366) . WLFRQLI PKI IQTFFYCTEISSTVT- Iv'YFRHDTW . . . (35) . . . 
E.a. pl23 (441). WIFEDLWSLIRCFFYVTEQQKSYSKTYYYRKNIW ...(35)... 

* ***** *** 

Motif 1 Motif 2 K 
p hh h K hR h R 

S.p. Tezlp AVIRLLPKK--NTFRLITN-LRKRF ...(61)...' 
S.c. Est2p SKMRI I PKKSNNEFRI IAIPCRGAD ...(62)... 
E.a. pl23 GKLRL I PKK- - TTFRPIMTFNKKIV ...(61)... 

★ * * * * * * 

Motif 3 (A) AF 

h hDh GY h 
S.p. Tezlp KKYFVRIDIKSCYDRIKQDLMFRIVK . . . (89) . . . 
S.c. Est2p ELYFMKFDVKSCYDSIPRMECMRILK ...(75)... 

E.a. pl23 KLFFATMDIEKCYDSVNREKLSTFLK ...(107)... 

★ * * * * * 

Motif 4 (B # ) 

hPQG pP hh h 

S.p. Tezlp YLQKVG I PQG S I LS SFLCHF YMEDL I DE YLSF ...(6)... 

S.c. Est2p YIREDGLFQGSSLSAPIVDLVYDDLLEFYSEF . . . (8) . . . 

E.a. pl23 YKQTKG I PQGLCVS S I L S S F YYATLEES S LGF ...(14)... 
* * * * * * ★ 

Y Motif 5 (C) Motif 6 (D) 

h F DDhhh Gh h cK h 

S.p. Tezlp VLLRVVDDFLFITVNKKDAKKFLNLSLRGFEKHNFSTSLEKTVINFENS .(205) 
S.c. Est2p LI LKLADDFL I 1 STDQQQVIN I KKLAMGG FQ K YnAKAMRDK I LAVS SQS .(173) 
E.a. pl23 " LLMRLTDDYLLITTQENNAVLFIEKLINVSRENGFKFMMKKLQTSFPLS . (209) 
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FIGURE 43 

Disruption strategy for the putative telomerase genes. 



wild type telomerase gene 



wild type telomerase gene 



1 . Transform with linear fragment 
containing the telomerase gene 
disrupted with a LEU2 or uro4 marker. 
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2. Assay in selective media. 

3. Sporulate, and grow on 
selective media. 



wild type telomerase gene 
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Senescence? (X2) 



(These cells will show a senescence phenotype 

if the disrupted gene encodes a telomerase subunit.) 



FIGURE 44 



An Example of Confirmation of tezl disruption By PCR 
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FIGURE 45 



Tezl disruption causes progressive 
shortening of telomeres in S. pombe 
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Generations 
after speculation 
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FIGURE 46 



1 agtaccgatttactttcctttcttcataagctaattgcttcctcgaacgctcctaaatctctggaaatatttttacaaga 80 
81 actcaataacaataccaagtcaaattccaatatgaaggtgttattagtgatcgataatatttctattttatcggtcgtta 160 
161 ccaagtataaggacaaaaagaacaacttccttccccctaaagacttttactttattaatttacttttcaaatatatttcg 240 
241 ggttcgcttacttttaatcgtggtactgttttagctgctacttctagccaaccgcgtgtttctaccccgtcattggatat 320 
321 agctcttggagtagctcacagaaatccttacaaatcttctgatgagactatattagattcattacagtccgtgcatattc 400 
401 ttaacatggagccttacactttagatgagtcacgtcgcatgatggagtatttggtatcatccaacgtttgccttgaaaag 480 
481 gttgataattatttgcaaaatcatgtccttagtggtggtaatccgcgaaagttttttgatgcttgcacacgtctagcatg 560 
561 attgagatattcaaaaatttctatccactacaactcctttaacgcggttttatttttctattttctattctcatgttgtt 640 
641 ccaaatatgtatcatctcgtattaggcttttttccgttttactcctggaatcgtacctttttcactattccccctaatga 720 
721 ataatctaaattagtttcgcttataattgatagtagtagaaagattggtgattctactcgtgtaatgttattagtttaaa 800 
801 gatactttgcaaaacatttattagctatcattatataaaaaaaatcctataattataaatattaatcaatatttgcggtc 880 
881 actatttatttaaaacgttatgatcagtaggacactttgcatatatatagttatgcttaatggttacttgtaacttgc 958 

959 ATG ACC GAA CAC CAT ACC CCC AAA AGC AGG ATT CTT CGC TTT CTA GAG AAT CAA TAT GTA 1018 
1MTEHHTPKSRILRFLENQYV 20 

1019 TAC CTA TGT ACC TTA AAT GAT TAT GTA CAA CTT GTT TTG AGA GGG TCG CCG GCA AGC TCG 1078 
21YLCTLNDYVQLVLRGSPASS 40 

lo79 TAT AGC AAT ATA TGC GAA CGC TTG AGA AGC GAT GTA CAA ACG TCC TTT TCT ATT TTT CTT 1138 
m 41yS NICERLRSDVQTSFSIFL 60 

5l39 CAT TCG ACT GTA GTC GGC TTC GAC AGT AAG CCA GAT GAA GGT GTT CAA TTT TCT TCT CCA 1198 
0 61HST VVGFDSKPDEGVQFSSP 80 



FIGURE 46 (cont) 

1199 AAA TGC TCA CAG TCA GAG gtatatatatttttgttttgat t tt tt tc tattcgggatagctaa ta ta tgggcag 1272 
81 K C S Q S E 86 

127 3 CTA ATA GCG AAT GTT GTA AAA CAG ATG TTC GAT GAA AGT TTT GAG CGT CGA AGG AAT CTA 1332 
87 L IANVVKQM F DE S F ER RRN L 106 

1333 CTG ATG AAA GGG TTT TCC ATG gtaaggtattctaattgtgaaatat t tacctgcaattactgt t tcaaagaga 1405 
107 L M K G F S M 113 

1406 ttgtatttaaccgataaag AAT CAT GAA GAT TTT CGA GCC ATG CAT GTA AAC GGA GTA CAA AAT 1469 
114 NHEDFRAMHVNGVQN 128 

147 0 GAT CTC GTT TCT ACT TTT CCT AAT TAC CTT ATA TCT ATA CTT GAG TCA AAA AAT TGG CAA 1529 
129 DLVSTFPNYLI S I LESKNWQ 148 

1530 CTT TTG TTA GAA AT gtaaataccggttaagatgttgcgcactttgaacaagactgacaagtatag T ATC GGC 1601 
149 L L L E I I G 155 



1602 AGT GAT GCC ATG CAT TAC TTA TTA TCC AAA GGA AGT ATT TTT GAG GCT CTT CCA AAT GAC 1661 
^156 SDAMHYLLSKGS I FEALPND 175 

2662 AAT TAC CTT CAG ATT TCT GGC ATA CCA CTT TTT AAA AAT AAT GTG 
"17 6 NYLQISGI PLFKNNV 

Vl22 TCA AAA AAA AG A AAG CGA ACC ATT GAA ACA TCC ATT ACT CAA AAT 
^196 SKKRKRTI ETSITQN 

"#782 GAA GTT TCC TGG AAT AGC ATT TCA ATT AGT AGG TTT AGC ATT TTT TAC AGG TCA TCC TAT 1841 
s^216 EVSWNSISISRFSIFYRSSY 235 

=f£842 AAG AAG TTT AAG CAA G gtaac taatactgttatccttcataactaatt t tag AT CTA TAT TTT AAC 
11907 

yff236 KKFKQD LYFN 245 

yQ 

SU908 TTA CAC TCT ATT TGT GAT CGG AAC ACA GTA CAC ATG TGG CTT CAA TGG ATT TTT CCA AGG 1967 
"246 LHSI CDR NTVHMWLQWIFPR 265 



TTT GAG GAA ACT GTG 1721 

F E E T V 195 

AAA AGC GCC CGC AAA 1781 

K S A R K 215 



1968 CAA TTT GGA CTT ATA AAC GCA TTT CAA GTG 
266 QFGLIN AFQV 

2028 TCA CAG AGT ACA GTT GTG CCC AAA CGT CTC 
286 SQSTVVPKRL 

2088 GCA AAG CGA CTC CAT CGT ATT TCT CTA TCA 
306 AKRLHRISLS 

214 8 GAC ACC CAC GAT GAT GAA AAA ATC CTT AGT 
326 DTHDDEKILS 

2208 TTT CTT CGA TCC ATT CTT GTT CGA GTG TTT 
346 FLRSILVRVF 



AAG CAA TTG CAC AAA GTG ATT CCA CTG GTA 2027 

KQLHKVIPLV 285 

CTA AAG GTA TAC CCT TTA ATT GAA CAA ACA 2087 

LKVYPLIEQT 305 

AAA GTT TAC AAC CAT TAT TGC CCA TAT ATT 2147 

KVYNHYCPYI 325 

TAT TCC TTA AAG CCG AAC CAG GTG TTT GCG 2207 

YSLKPNQVFA 345 

CCT AAA TTA ATC TGG GGT AAC CAA AGG ATA 2267 

PKLIWGNQRI 365 



2268 TTT GAG ATA ATA TTA AAA G gtattgtataaaatttattaccactaacgatt t taccag AC CTC GAA ACT 2336 
366 FEIILKD LET 375 



# # 
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2337 TTC TTG AAA TTA TCG AGA TAC GAG TCT TTT AGT TTA CAT TAT TTA ATG AGT AAC ATA AAG 2 396 

376 FLKLSRYESFSLHYLMSNIK 395 

2397 gtaatatgccaaatttttttaccattaattaacaatcag ATT TCA GAA ATT GAA TGG CTA GTC CTT GGA 2465 

• 396 ISEIEWLVLG 405 

2466 AAA AGG TCA AAT GCG AAA ATG TGC TTA AGT GAT TTT GAG AAA CGC AAG CAA ATA TTT GCG 2525 

406 KRSNAKMCLSDFEKRKQIFA 425 

2526 GAA TTC ATC TAC TGG CTA TAC AAT TCG TTT ATA ATA CCT ATT TTA CAA TCT TTT TTT TAT 2 585 

426 EFIYWLYNS FI I PI LQSFFY 445 

2586 ATC ACT GAA TCA AGT GAT TTA CGA AAT CGA ACT GTT TAT TTT AGA AAA GAT ATT TGG AAA 2 645 

446 ITESSDLRNRTVYFRKDIWK 465 

2646 CTC TTG TGC CGA CCC TTT ATT ACA TCA ATG AAA ATG GAA GCG TTT GAA AAA -ATA AAC GAG 27 05 

466 LLCRPFITSMKMEAFEKINE 485 

2706 gtattttaaagtattttttgcaaaaagctaatattttcag AAC AAT GTT AGG ATG GAT ACT CAG AAA ACT 277 5 

^ 4 g 6 NNVRMDTQKT 495 

S 2776 ACT TTG CCT CCA GCA GTT ATT CGT CTA TTA CCT AAG AAG AAT ACC TTT CGT CTC ATT ACG 283 5 

P 496 TLPPAVIRLLPKKNTFRLIT 515 

Ef 2836 AAT TTA AGA AAA AGA TTC TTA ATA AAG gtattaatttttggtcatcaatgtactttacttctaatctatta 2906 

J~f 516 NLRKRFLIK 524 

^ 2907 ttagcag ATG GGT TCA AAC AAA AAA ATG TTA GTC AGT ACG AAC CAA ACT TTA CGA CCT GTG 2967 

;L 525 MGSNKKMLVSTNQTLRPV 542 

«F 2968 GCA TCG ATA CTG AAA CAT TTA ATC AAT GAA GAA AGT AGT GGT ATT CCA TTT AAC TTG GAG 3027 

fy 543 A S I LK H L I N E ES SG I P FN L E 562 

ij 3028 GTT TAC ATG AAG CTT CTT ACT TTT AAG AAG GAT CTT CTT AAG CAC CGA ATG TTT GG gtaat 3088 

SJ 563 V Y M K L L T F K K D L L K H R M F G 581 

3089 tatataatgcgcgattcctcattattaattttgcag G CGT AAG AAG TAT TTT GTA CGG ATA GAT ATA 3155 

582 RKKYFVRIDI 591 

3156 AAA TCC TGT TAT GAT CGA ATA AAG CAA GAT TTG ATG TTT CGG ATT GTT AAA AAG AAA CTC 3215 

592 KSC YDR I KQDLMFRIVKKKL 611 

3216 AAG GAT CCC GAA TTT GTA ATT CGA AAG TAT GCA ACC ATA CAT GCA ACA AGT GAC CGA GCT 3275 

612 KDPEFVI RKYATIHATSDRA 631 

3276 ACA AAA AAC TTT GTT AGT GAG GCG TTT TCC TAT T gtaagtttattttttcattggaattttttaacaa 3343 

632 T K N F V S E A F S Y F 643 

3344 attcttttttag TT GAT ATG GTG CCT TTT GAA AAA GTC GTG CAG TTA CTT TCT ATG AAA ACA 3405 

644 DMVPFEKVVQLLSMKT 659 

3406 TCA GAT ACT TTG TTT GTT GAT TTT GTG GAT TAT TGG ACC AAA AGT TCT TCT GAA ATT TTT 3465 

660 SDTLFVDFVDYWTKSSSEI F 679 

3466 AAA ATG CTC AAG GAA CAT CTC TCT GGA CAC ATT GTT AAG gtataccaattgttgaattgtaataaca 3532 

680 KMLKEHLSGHIVK 692 
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3533 ctaatgaaactag ATA GGA AAT TCT CAA TAC CTT CAA AAA GTT GGT ATC CCT CAG GGC TCA 3593 

693 IGNSQYLQKVGI PQGS 708 

3594 ATT CTG TCA TCT TTT TTG TGT CAT TTC TAT ATG GAA GAT TTG ATT GAT GAA TAC CTA TCG 3 653 

709 jLSSFLCHFYMEDLIDEYLS 728 

3654 TTT ACG AAA AAG AAA GGA TCA GTG TTG TTA CGA GTA GTC GAC GAT TTC CTC TTT ATA AC A 3713 

729 FTKKKGSVLLRVVDDFLFIT 748 

3714 GTT AAT AAA AAG GAT GCA AAA AAA TTT TTG AAT TTA TCT TTA AGA G gtgagt tgc tg tea t tec 3777 

749 VNKKDAKKFLNLSLRG 764 

3778 taagttctaaccgttgaag GA TTT GAG AAA CAC AAT TTT TCT ACG AGC CTG GAG AAA ACA GTA 3840 

765 FEKHNFSTSLEKTV 778 

3841 ATA AAC TTT GAA AAT AGT AAT GGG ATA ATA AAC AAT ACT TTT TTT AAT GAA AGC AAG AAA 3900 

779 INFENSNGI INNTFFNESKK 798 

3901 AGA ATG CCA TTC TTC GGT TTC TCT GTG AAC ATG AGG TCT CTT GAT ACA TTG TTA GCA TGT 3960 

799 RM PFFGFSVNMRSLDTLLAC 818 

5*3961 CCT AAA ATT GAT GAA GCC TTA TTT AAC TCT ACA TCT GTA GAG CTG ACG AAA CAT ATG GGG 4020 

ffj 819 PKIDEALFNSTSVELTKHMG 838 

^4021 AAA TCT TTT TTT TAC AAA ATT CTA AG gta tac tg tgtaac tgaataatagc tgacaaataatcag A TCG 4089 

q 839 KSFFYKILR S 848 

H4 090 AGC CTT GCA TCC TTT GCA CAA GTA TTT ATT GAC ATT ACC CAC AAT TCA AAA TTC AAT TCT 414 9 

849 S L A S F A Q V F I D I T H N S K F N S 868 



^4150 TGC TGC AAT ATA TAT AGG CTA GGA TAC TCT ATG TGT ATG AGA GCA CAA GCA TAC TTA AAA 42 09 
J 869 CCNIYRLGYS 



- - - y R L G Y S M C M R A Q A Y L K 888 



^4210 AGG ATG AAG GAT ATA TTT ATT CCC CAA AGA ATG TTC ATA ACG G gtgagtacttatt t taactaga 4274 
889 R M K D I F I P Q R M F I T D 903 

4275 aaagtcattaattaaccttag AT CTT TTG AAT GTT ATT GGA AGA AAA ATT TGG AAA AAG TTG GCC 4339 
904 LLNVIGRKIWKKLA 917 

4340 GAA ATA TTA jGGA TAT ACG .AGT AGG CGT TTC TTG TCC TCT GCA GAA GTC AAA TG gtacgtgtc 4401 
918 EILGYTSRRFLSSAEVKW 935 

44 02 ggtctcgagacttcagcaatattgacacatcag G CTT TTT TGT CTT GGA ATG AGA GAT GGT TTG AAA 44 68 
936 LFCLGMRDGLK 946 

4469 CCC TCT TTC AAA TAT CAT CCA TGC TTC GAA CAG CTA ATA TAC CAA TTT CAG TCA TTG ACT 4 528 
947 psFKYHPCFEQLIYQFQSLT 966 

4529 GAT CTT ATC AAG CCG CTA AGA CCA GTT TTG CGA CAG GTG TTA TTT TTA CAT AGA AGA ATA 4588 
967 DLI KPLRPVLRQVLFLHRRI 986 

4589 GCT GAT TAA tgtcattttcaatttattatatacatcct t tat tac tggtgtcttaaacaatattattactaagtata 4665 
987 A D * 989 
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4666 gctgacccccaaagcaagcatactataggatttctagtaaagtaaaattaatctcgttattagttttgattgacttgtct 474 5 
47 4 6 ttatccttatacttttaagaaagattgacagtggttgctgactactgcccacatgcccattaaacgggagtggttaaaca 4825 
4826 ttaaaagtaatacatgaggctaatctcctttcatttagaataaggaaagtggttttctataatgaataatgcccgcacta 4905 

4906 atgcaaaaagacgaagattatcttctaaacaagggggattaagcatatccgaaggaaaagagagtaatatacccagtgtt 4 985 
4986 gttgaagaaagcaaggataatttggaacaagcttctgcagatgacaggctaaattttggtgaccgaattttggtaaaagc 5065 
5066 cccaggttatccatggtggccggccttgctactgagacgaaaagaaactaaggatagtttgaatactaatagctcattta 5145 
5146 atgtcttatataaggttttgttttttcctgacttcaattttgcatgggtgaaaagaaatagtgttaagccattattggat 522 5 
522 6 tccgaaatagccaaatttcttggttcctcaaagcggaagtctaaagaacttattgaagcttatgaggcttcaaaaactcc 5305 
5306 tcctgatttaaaggaggaatcttccaccgatgaggaaatggatagcttatcagctgctgaggagaagcctaattttttgc 5385 
5386 aaaaaagaaaatatcattgggagacatctcttgatgaatcagatgcggagagtatctccagcggatccttgatgtcaata 54 65 
54 66 acttctatttctgaaatgtatggtcctactgtcgcttcgacttctcgtagctctacgcagttaagtgaccaaaggtacc 5544 



# 



FIGURE 47 



1 

met ser val tyr val val glu leu leu 
GCCAAGTTCCTGCACTGGCTG ATG AGT GTG TAC GTC GTC GAG CTG CTC 



arg ser phe phe tyr val thr glu thr thr phe gin lys asn arg 
AGG TCT TTC TTT TAT GTC ACG GAG ACC ACG TTT CAA AAG AAC AGG 

30 

leu phe phe tyr arg lys ser val trp ser lys leu gin ser ile 
CTC TTT TTC TAC CGG AAG AGT GTC TGG AGC AAG TTG CAA AGC ATT 

40 50 
gly ile arg gin his leu lys arg val gin leu arg glu leu ser 
GGA ATC AGA CAG CAC TTG AAG AGG GTG CAG CTG CGG GAG CTG TCG 

60 

glu ala glu val arg gin his arg glu ala arg pro ala leu leu 
GAA GCA GAG GTC AGG CAG CAT CGG GAA GCC AGG CCC GCC CTG CTG 

70 80 
thr ser arg leu arg phe ile pro lys pro asp gly leu arg pro 
ACG TCC AGA CTC CGC TTC ATC CCC AAG CCT GAC GGG CTG CGG CCG 

90 

ile val asn met asp tyr val val gly ala arg thr phe arg arg 
ATT GTG AAC ATG GAC TAC GTC GTG GGA GCC AGA ACG TTC CGC AGA 

100 no 

glu lys ala glu arg leu thr ser arg val lys ala leu phe 

GAA AAG ARG GCC GAG CGT CTC ACC TCG AGG GTG AAG GCA CTG TTC 

120 

ser val" leu asn tyr glu arg ala arg arg pro gly leu leu gly 
AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC CCC GGC CTC CTG GGC 

130 140 

ala ser val leu gly leu asp asp ile his arg ala trp arg thr 
GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC TGG CGC ACC 

150 

phe val leu arg val arg ala gin asp pro pro pro glu leu tyr 
TTC GTG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG CTG TAC 

160 170 

phe val lys val asp val thr gly ala tyr asp thr ile pro gin 
TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC CAG 



10 



20 



180 

asp arg leu thr glu val ile ala ser ile ile lys pro gin asn 
GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG AAC 
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190 

thr tyr cys val arg 
ACG TAC TGC GTG CGT 



gly thr ser ala arg 
GGC ACG TCC GCA AGG 

220 

gin gly ile pro gin 
CAG GGG ATC CCG GAG 



leu cys tyr gly asp 
CTG TGC TAC GGC GAC 

250 

arg asp gly leu leu 
CGG GAC GGG CTG CTC 



thr pro his leu thr 
ACA CCT CAC CTC ACC 

280 

arg gly val pro glu 
CGA GGT GTC CCT GAG 



val val asn phe pro 
GTG GTG AAC TTC CCT 

310 

phe val gin met pro 
TTT GTT CAG ATG CCG 



leu leu asp thr arg 
CTG CTG GAT ACC CGG 

340 

tyr ala arg thr ser 
TAT GCC CGG ACC TCC 



phe lys ala gly arg 
TTC AAG GCT GGG AGG 

370 

arg leu lys cys his 
CGG CTG AAG TGT CAC 



arg tyr ala val val 
CGG TAT GCC GTG GTC 

210 

pro ser arg ala thr 
CCT TCA AGA GCC ACG 



gly ser ile leu ser 
GGC TCC ATC CTC TCC 

240 

met glu asn lys leu 
ATG GAG AAC AAG CTG 



leu arg leu val asp 
CTG CGT TTG GTG GAT 

270 

his ala lys thr phe 
CAC GCG AAA ACC TTC 



tyr gly cys val val 
TAT GGC TGC GTG GTG 

300 

val glu asp glu ala 
GTA GAA GAC GAG GCC 



ala his gly leu phe 
GCC CAC GGC CTA TTC 

330 

thr leu glu val gin 
ACC CTG GAG GTG CAG 



ile arg ala ser leu 
ATC AGA GCC AGT CTC 

360 

asn met arg arg lys 
AAC ATG CGT CGC AAA 



ser leu phe leu asp 
AGC CTG TTT CTG GAT 



200 

gin lys ala ala met 
CAG AAG GCC GCC ATG 



ser tyr val gin cys 
TCC TAC GTC CAG TGC 

230 

thr leu leu cys ser 
ACG CTG CTC TGC AGC 



phe ala gly ile arg 
TTT GCG GGG ATT" CGG 

260 

asp phe leu leu val 
GAT TTC TTG TTG GTG 



leu arg thr leu val 
CTC AGG ACC CTG GTC 

290 

asn leu arg lys thr 
AAC TTG CGG AAG ACA 



leu gly gly thr ala 
CTG GGT GGC ACG GCT 

320 

pro trp cys gly leu 
CCC TGG TGC GGC CTG 



ser asp tyr ser ser 
AGC GAC TAC TCC AGC 

350 

thr phe asn arg gly 
ACC TTC AAC CGC GGC 



leu phe gly val leu 
CTC TTT GGG GTC TTG 

380 

leu gin val asn ser 
TTG CAG GTG AAC AGC 



♦ 
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390 

leu gin thr val cys thr asn ile tyr lys ile leu leu leu gin 
CTC CAG ACG GTG TGC ACC AAC ATC TAC AAG ATC CTC CTG CTG CAG 

400 410 

ala tyr arg phe his ala cys val leu gin leu pro phe his gin 
GCG TAC AGG TTT CAC GCA TGT GTG CTG CAG CTC CCA TTT CAT CAG 

420 

gin val trp lys asn pro his phe ser cys ala ser ser leu thr 
CAA GTT TGG AAG AAC CCA CAT TTT TCC TGC GCG TCA TCT CTG ACA 

430 440 

arg leu pro leu leu leu his pro glu ser gin glu arg arg asp 
CGG CTC CCT CTG CTA CTC CAT CCT GAA AGC CAA GAA CGC AGG ,GAT 

450 

val ala gly gly gin gly arg arg arg pro ser ala leu arg gly 
GTC GCT GGG GGC CAA GGG CGC CGC CGG CCC TCT GCC CTC CGA GGC 

460 470 

arg ala val ala val pro pro ser ile pro ala gin ala asp ser 
CGT GCA GTG GCT GTG CCA CCA AGC ATT CCT GCT CAA GCT GAC TCG 

480 

thr pro cys his leu arg ala thr pro gly val thr gin asp ser 
ACA CCG TGT CAC CTA CGT GCC ACT CCT GGG GTC ACT CAG GAC AGC 

490 500 

pro asp ala ala glu ser glu ala pro gly asp asp ala asp cys 
CCA GAC GCA GCT GAG TCG GAA GCT CCC GGG GAC GAC GCT GAC TGC 

510 

pro gly gly arg ser gin pro gly thr ala leu arg leu gin asp 
CCT GGA GGC CGC AGC CAA CCC GGC ACT GCC CTC AGA CTT CAA GAC 

520 530 

his pro gly leu met ala thr arg pro gin pro gly arg glu gin 
CAT CCT GGA CTG ATG GCC ACC CGC CCA CAG CCA GGC CGA GAG CAG 

540 

thr pro ala ala leu ser arg arg ala tyr thr ser gin gly gly 
ACA CCA GCA GCC CTG TCA CGC CGG GCT TAT ACG TCC CAG GGA GGG 

550 560 

arg gly gly pro his pro gly leu his arg trp glu ser glu ala 
AGG GGC GGC CCA CAC CCA GGC CTG CAC CGC TGG GAG TCT GAG GCC 

564 
OP 

TGA GTGAGTGTTTGGCCGAGGCCTGCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAGGC 



CTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTCCAGCACACCTGCGTTTTCACTTCCCCAC 
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AGGCTGGCGTTCGGTCCACCCCAGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTCCACT 
CCCCACATAGGAATAGTCCATCCCCAGATTCGCCATTGTTCACCCTTCGCCCTGCCTTCC 
TTTGCCTTCCACCCCCACCATTCAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTTTGGG 
AATTTGGAGTGACCAAAGGTGTGCCCTGTACACAGGCGAGGACCCTGCACCTGGATGGGG 
GTCCCTGTGGGTCAAATTGGGGGGAGGTGCTGTGGGAGTAAAATACTGAATATATGAGTT 
TTTCAGTTTTGGAAAAAAAAAAAAAAAAAAAAAAAAAA 



* i 

FIGURE 48 

Motif -1 

Ep pl23 ...LVVSLIRCFFYVTEQQKSYSKT... 

Sp Tezl ...FIIPILQSFFYITESSDLRNRT... 

Sc Est2 ...LIPKIIQTFFYCTEISSTVTIV... 

HsTCPl ...YVVELLRSFFYVTETTFQKNRL... 
consensus FFY TE 



K 

Motif 0 phhh K hRh R 

Ep pl23 ...KSLGFAPGKLRLIPKXT-TFRPIMTFNKKIV... 

Sp Tezl . . .QKTTLPP A VIRLLPKKN--TFRLITNLRKRFL . . . 

Sc Est2 ...TLSNFNHSKMRIIPKKSNNEFRJIAIPCRGAD... 

Hs TCP1 ...ARPALLTSRLRFIPKPD--GLRPIVNMDYVVG.. 

consensus R PK R I 



AF 

Motif A h hDh GY h 

Ep pl23 ...PKLFFATMDIEKCYDSVNREKLSTFLK... 

Sp Tezl ...RKKYFVRIDIKSCYDRIKQDLMFRIVK... 

Sc Est2 ...PELYFMKFDVKSCYDSIPRMECMRILK... 

Hs TCP1 . ..PEL YFVKVD VTGA YDTIPQDRLTEVI A. . .//. . . 
consensus F D YD 

Motif B hPQG pS hh 

Ep pl23 ...NGKFYKQTKGIPQGLCVSSILSSFYYA... 

Sp Tezl ...GNSQYLQKVGIPQGSILSSFLCHFYME... 

Sc Est2 ...EDKCYIREDGLFQGSSLSAPIVDLVYD... 

Hs TCP1 ...RATSYVQCQGIPQGSILSTLLCSLCYG... 
consensus G QG S 

Y 

Motif C h FDDhhh 

Eppl23 . . . PNVNLLMRLTDD YLLITTQ ENN . . . 

SpTezl ...KKGSVLLRVVDDFLFITVNKKD... 

Sc Est2 ...SQDTLILKLADDFLIISTDQQQ... 

HsTCPl ...RRDGLLLRLVDDFLLVTPHLTH... 
consensus DD L 

Motif D Ghh cK 

Eppl23 ...NVSRENGFKFNMKKL... 

Sp Tezl ...LNLSLRGFEKHNFST... 

Sc Est2 ...KKLAMGGFQKYNAKA... 

HsTCPl ...LRTLVRGVPEYGCVV... 
consensus G 



